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JYROS1NE PHOSPHORYLATED CLEAVAGE 
FITRROW-ASSOriATFn PROTEINS rPSTPIP^ 
FIELD OF THE INVENTION 

This present invention concerns novel proteins which interact with and are dephosphorylated by PEST- 
type protein tyrosine phosphatases. More particularly, the present invention concerns PSTPIP polypeptides 
which interact with the protein tyrosine phosphatase enzyme PTP HSCF and which are associated with the 
polymerization of actin monomers. 

BACKGROUND OF THE INVENTION 
The phosphorylation of tyrosine residues in eukaryotic proteins has been shown to play extremely 
important roles in the regulation of numerous eukaryotic cellular processes (Fantl et al., Annu. Rev. Biochem. 
62:453-481 (1993) and Hunter, 1001 Protein Kinases Redux Toward 2000 5:367-376 (1994)). While a great 
deal of information has been accumulated regarding the functions of the protein tyrosine kinases, far less is 
understood about the physiological roles of protein tyrosine phosphatases (PTPs), the enzymes which remove 
phosphate from tyrosine residues in proteins. While approximately 50 PTPs have now been described, the 
functions of only a very few are beginning to be understood (Tonks. Semin. Ceil Biol. 4:373-453 (1993) and 
Dixon. Recent Prog. Horm. Res. 51:405-414 (1996)). However, in general, it appears that many of the PTPs 
function to modulate the positive or negative signals induced by various protein tyrosine kinases. Therefore, 
it is likely that PTPs play critical roles in numerous and diverse cellular processes. 

The PEST family of PTPs are a group of phosphatase enzymes. The four known examples of these 
enzymes, PTP PEST [Yang et al., J. Bioi Chem. 268(23): 1 7650 ( 1 993)], PTP PEP [Matthews et al., MoL Cell 
Biol 12(5):2396-2405 (1992)], PTP HSCF [Cheng et al.. Blood 88(4): 1 156-1 167 (1996); USSN 08/620,526 
filed 22 March 1996]; also known asPTP-Kl [Huang et al.. Oncogene 13:1567-1573 (1996)], PTP20 [Aoki 
et aLJ. Biol Chem. 271(46):29422-29426 (1996)] orFLPl [Dosil et al., Blood 88(1 2):45 10-4525 (1996)] 
and PTP BDP1 (Kim et al., Oncogene 13:2275-2279 (1996)), all contain an N-terminal phosphatase domain 
which is followed by a variably sized region that is rich in proline, serine and threonine residues, but which 
has no obvious homology to other proteins. The PEST family of PTPs also contain a highly conserved 20 
amino acid long proline rich region at the very C-terminus of the proteins which are believed to be involved 
in protein-protein interactions. With regard to cell type expression, PTP PEST is ubiquitously expressed 
(Yang et al., (1993) supra ). PTP PEP is expressed in lymphdid cells (Matthews et al., (1992) supra ! PTP 
HSCF is expressed in hematopoietic stem/progenitor cells and fetal thymus (Cheng et al. (1996) supra and 
Dosil et al., (1996) supra ) as well as a subset of adult tissues including bone marrow (Huang et aL, (1996) 
supra ) and PTP BDP1 is expressed at low levels in the brain as well as other adult tissues (Kim et al., (1996) 
supra ). 

Insight into the physiological functions of PEST PTPs may be obtained from an examination of the 
proteins which interact with these enzymes, the effects of overexpression of the proteins on cellular 
differentiation and the possible modes of regulation of the molecules. Transfection of dominant negative forms 
of PTP PEST into COS cells results in an endogenous, hyperphosphorylated protein that has been identified 
as pi 30^ a cytoplasmic docking/adaptor-type molecule which contains an SH3 domain as well as several 
potential tyrosine phosphorylated SH2 binding sites (Garton et aL, Mol. Cell Biol 16(1 1):6408-6418 (1996)). 



83 77 



-1- 



WO 98/35037 PCT/US98/01774 

The function of pi 30*~AS j s incompletely understood, but it appears to be associated with focal adhesions 
and is phosphorylated by the pl25 FAK (Petch et al., J, Ceil. Set. 108:1371-1379 (1995) and the RAFTK 
(Astier et al., J. Biol. Chem. 272(1 ):228-232 (1997) tyrosine kinases, suggesting that it may play a role in 
integrin-mediated signal transduction. Because dominant negative PTP PEST inhibits dephosphorylation of 

5 pl30^- AS , it is likely that this phosphoprotein is a substrate for this PTP. 

Interestingly, it has also been recently shown that the PTB domain of the cytoplasmic adaptor protein 
SHC interacts with a non-phosphorylated PTB-related binding site in the C-terminal region PTP PEST 
(Charest et al, J. Biol. Chem. 271 (14): 8424-8429 (1996)). In addition, recent data have demonstrated that 
Csk, a cytoplasmic tyrosine kinase which inactivates Src family kinases by phosphorylation of their C-terminal 

10 inhibitory tyrosines, associates with the PEP PTP via an interaction between the Csk SH3 domain and one of 
the four proline-rich potential SH3 binding sites in the C-terminal region of the enzyme (Cloutier et al., Eh4BO 
J. 15(18):4909-4918 (1996)). Together, these results suggest that the biological activities of PTP PEST and 
PTP PEP (as well as possibly other PEST PTPs) are mediated through their interaction with critical 
cytoplasmic signaling proteins involved with the transmission of information from various cell surface 

15 receptors. 

However, it is believed that the PSTPIP proteins which bind to and are dephosphorylated by members 
of the PEST-type protein tyrosine phosphatases have not been heretofore disclosed. Therefore, it is an object 
of the present invention to provide PSTPIP polypeptides which bind to and are dephosphorylated by members 
of the PEST-type protein tyrosine phosphatases. 
20 It is a further object of the present invention to provide nucleic acid encoding the PSTPIP polypeptides 

so that those polypeptides may be prepared by recombinant DNA techniques. 

These and further objects will be apparent to the ordinarily skilled artisan upon consideration of the 
specification as a whole. 

SUMMARY OF THE INVENTION 

25 These objects are accomplished, in one aspect, by providing isolated PSTPIP polypeptides selected from 

the group consisting of: 

(i) a polypeptide comprising the amino acid sequence of the PSTPIP polypeptide shown in 
Fig. lA(SEQIDNO: 1); 

(ii) a further mammalian homologue of the polypeptide of (i); 

30 (iii) a polypeptide encoded by nucleic acid which hybridizes under stringent conditions to 

nucleic acid encoding the polypeptide of (i) or (ii) and which substantially retains the 
ability to bind to a member of the PEST-type protein tyrosine phosphatases; and 
(iv) a functional derivative of any of the polypeptides (i)-(iii) substantially retaining the ability 

to bind to a member of the PEST-type protein tyrosine phosphatases. 
35 In another aspect, the present invention provides antagonists of the above described PSTPIP 

polypeptides. 

In yet other aspects, the present invention provides isolated nucleic acid sequences which encode the 
above described PSTPIP polypeptides, vectors comprising those nucleic acid sequences operably linked to 
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control sequences recognized by host cells transformed with those vectors and host cells comprising the above 
described nucleic acid sequences. 

In yet other aspects, the present invention provides antibodies which are capable of binding to the 
above described PSTPIP polypeptides and hybridoma cell lines which produce such antibodies. In one 
5 embodiment, the antibodies are monoclonal antibodies. 

The present invention also provides a method for producing the above described PSTPIP 
polypeptides comprising transforming a host cell with nucleic acid encoding the polypeptide, culturing the 
transformed cell and recovering the polypeptide from the cell culture. 

In another embodiment, the present invention provides a method for inducing the polymerization 
1 0 of actin monomers in a eukaryotic cell comprising introducing the above described PSTPIP polypeptide into 
the cell. 

The present invention also provides an assay for identifying antagonists and agonists of the above 
described PSTPIP polypeptides comprising contacting the PSTPIP polypeptide with a candidate antagonist 
or agonist and monitoring the ability of the polypeptide to induce the polymerization of actin monomers. 
15 In yet another embodiment, the invention concerns an assay for identifying a polypeptide capable 

of interacting with a PST Phosphatase Interacting Protein (PSTPIP), comprising 

(a) expressing nucleic acid molecules encoding a polypeptide comprising a fusion of a native 
PSTPIP sequence or a fragment thereof to the DN A-binding domain of a transcriptional activator, and a fusion 
of a candidate polypeptide to the activation domain of a transcriptional activator, in a single host cell carrying 

20 a reporter gene; and 

(b) monitoring the association of said candidate polypeptide wit said native PSTPIP sequence 
or a fragment thereof by detecting a signal of the molecule encoded by said reporter gene. 

The present invention further concerns an assay for identifying peptides capable of inhibiting the 
interaction of a native PST Phosphatase Interacting Protein (PSTPIP) and a native protein tyrosine phosphatase 

25 hematopoietic stemcell fraction (PTP HSCF), which comprises contacting said PSPIP and a PTP HSCF, or 
fragments thereof, with a candidate peptide, and detecting the ability of the PTPPIP and PTP HSCF, of 
fragments thereof, to interact with each other 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1 A-1C. Protein Sequence and Putative Domain Structure of PSTPIP. A. Illustrated 

30 is a comparison of the protein sequences of murine PSTPIP (PSTPIP) (SEQ ID NO:l) and S. pom be cdcl5 
(cdcl5) (SEQ ID NO:26). The asterisks illustrate the conserved tyrosine residues and the shows the 
conserved potential SH3 binding site. The predicted coiled coil and SH3 domains are overlined. B. Sequence 
comparisons of the SH3 domains of PSTPIP (pstpip.sh3) (SEQ ID NO:3) and several different proteins known 
to interact with the cytoskeieton including myosin heavy chain (myosin.sh3) (SEQ ID NO:4), spectrin 

35 (spectrin.sh3) (SEQ ID NO:5), fodrin (fodrin.sh3) (SEQ ID NO:5), hematopoietic specific protein 1 (hsp.sh3) 
(SEQ ID NO:6) and cortactin (cortactin.sh3) (SEQ ID NO:7). C. Domain structure of PSTPIP and cdcl5p. 
Illustrated are the predicted coiled coil regions containing regions rich in basic and acidic residues (+-+), the 
conserved tyrosine residues (*), the conserved potential SH3 binding site (t) and the conserved SH3 domains. 
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Also, shown is the large region in the S. pombe protein which contains predicted PEST degradation signals 
and which is missing from the mammalian homologue. 

Figures 2A-2B. Northern Blot Analysis of the Expression f PSTPIP Transcript. A. 
Expression of PSTPIP and actin in heart (lane a), brain (lane b), spleen (lane c), lung (lane d), liver (lane e), 
5 muscle (lane f)* kidney (lane g) and testis (lane h). B. Expression of PSTPIP and actin in 7 day murine 
embryos (lane a), 1 1 day murine embryos (lane b), 15 day ( murine embryos (lane c) and 17 day murine 
embryos (lane d). 

Figure 3. Interaction Between PTP HSCF and GST-PSTPIP. Shown are precipitations of in 
vitro transcribed and translated PTP HSCF phosphatase with GST-p85 (lane a), GST alone (lane b), GST-Src 

10 (lane c), GST-Grb-2 (lane d), GST-PSTPIP (lane e), GST-AbI (lane 0, GST-PLC (lane g), anti-PTP HSCF 
polyclonal antibody (lane h) and GST-Spectrin (lane i). 

Figures 4A-4D. Mapping of the PSTPIP Interaction Site on PTP HSCF. A. Shown are PTP 
HSCF constructs containing full-length, C-terminal homology (CTH) and PST-rich domain deletions used for 
in vitro transcription and translation. B. Precipitation of in vitro transcribed and translated forms of PTP HSCF 

1 5 with GST-PSTPIP or anti-PTP HSCF polyclonal antibody. Lanes are designated as follows: full-length PTP 
HSCF with anti-PTP HSCF (lane a), full-length PTP HSCF with GST-PSTPIP (lane b), PST-rich+CTH deleted 
PTP HSCF with anti-PTP HSCF (lane c), PST-rich+CTH deleted PTP HSCF with GST-PSTPIP (lane d), PST- 
rich+CTH deleted PTP HSCF with GST-Spectrin (lane e), CTH-deleted PTP HSCF with GST-Spectrin (lane 
f), CTH-deleted PTP HSCF with GST-PSTPIP (lane g), CTH-deleted PTP HSCF with anti-PTP HSCF (lane 

20 h). full-length PTP HSCF with anti-PTP HSCF (lane i). C. Precipitation of//? vitro transcribed and translated 
PSTPIP with anti-PSTPIP polyclonal antibody (lane a), 10 ug of GST-PST-rich+CTH PTP HSCF (a GST 
construct containing the PST-rich and CTH domains of the phosphatase) (lane b), 5 ug of GST-PST-rich+CTH 
PTP HSCF (lane c), 2 ug of GST-PST-rich+CTH PTP HSCF (lane d) or 1 ug of GST-PST-rich+CTH PTP 
HSCF (lane e). D. Precipitation of in vitro transcribed and translated PSTPIP with GST-PST-rich+CTH PTP 

25 HSCF in the presence of increasing amounts of proline rich peptides derived from the C-terminal homology 
regions of PTPs HSCF, PEST and PEP or a control proline rich peptide from PTP HSCF. 

Figures 5A-5B. Mapping of PTP HSCF Interaction Site on PSTPIP. A. Shown are GST 
fusions containing the full-length, coiled coil and SH3 domains of PSTPIP. B. Precipitation of full-length 
PTP HSCF with GST- full-length PSTPIP (lane a), anti-herriaggiutinin (directed against a hemagglutinin 

30 epitope tag at the N-terminus of the PTP HSCF) (lane b), GST-Grb2 (lane c), GST-Spectrin (lane d), GST-full- 
length PSTPIP (lane e), GST-SH3 PSTPIP (lane f) and GST-coiled coil PSTPIP (lane g). 

Figures 6A-6F. In vivo Tyrosine Phosphorylation of PSTPIP. A. Illustrated is the 
immunoprecipitation of endogenous PSTPIP from Baf3 cells with anti-PSTPIP polyclonal antibody in the 
presence and absence of the PTP inhibitor pervanadate. Precipitates were blotted with either anti-PSTPIP 

35 (a PSTPIP) or anti-phosphotyrosine (a PTyr) antibodies. Note that the protein in the absence of pervanadate 
is more diffuse and shows a lower phosphotyrosine content than the protein in the presence of the inhibitor. 
B. Shown are immunoprecipitations done with the indicated antibodies on cells transfected as illustrated. C. 
Immunoprecipitation of PSTPIP with anti-FLAG antibody (a FLAG) directed against a C-terminal PSTPIP 
FLAG epitope and blotting with anti-phosphotyrosine antibodies (a P-Tyr) . D. Immunoprecipitation of 
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PSTPIP with anti-FLAG antibody (a FLAG) and blotting with anti-FLAG (a FLAG), The absence of visible 
protein in the lanes containing tyrosine phosphorylated PSTPIP may be due to phosphorylation of the tyrosine 
in the FLAG epitope. The proteins are clearly visible in the anti-phosphotyrosine blot, however. E. 
Immunoprecipitation of PTP HSCF with anti-HA antibody (a HA) directed against an N-terminal 
5 hemagglutinin epitope and blotting with the same antibody. F. Shown are coprecipitation experiments 
demonstrating that precipitation of PSTPIP (anti-FLAG tagged) brings down PTP HSCF (anti-HA tagged) and 
precipitation of PTP HSCF (anti-HA tagged) brings down PSTPIP (anti-FLAG tagged). 

Figure 7. Localization of Endogenous PSTPIP in 3T3 Cells. Shown are confocal images of two 
different groups of 3T3 cells viewed at different focal planes stained with anti-PSTPIP antibody (Cy3) and 

10 phalloidin-FITC (panels a-d). Sites of colocalization appear yellow and are the cortical actin (c.a.), the 
lamellipodia (lam.) and the stress fibers (s.f.). Panels e-g illustrate a lower magnification and two high 
magnification views, respectively, of interphase cells and cells undergoing cytokinesis stained with the same 
reagents. The interphase cells show co-localization predominantly in the cortical actin (c.a.) region at this 
focal plane, while the cells undergoing cytokinesis show colocalization predominantly at the cleavage furrow 

1 5 (c.f.) at both focal planes shown. The bars show sizes in microns. 

Figure 8. Expression of PSTPIP in Transfected 3T3 Cells. Panel a shows a group of 3T3 cells 
transfected with an expression plasmid containing a C-terminal FLAG version of PSTPIP under the control 
of the cytomegalovirus promoter. Cells were stained with anti-FLAG (Cy3) and phalloidin-FITC. PSTPIP 
co-locaiizes with actin at the conical region (c.a.), the stress fibers (s.f.) and the lamellipodia (lam.)- Panels 

20 b and c illustrate two cells with abnormal morphology expressing PSTPIP. Note that these fiiopodial 
structures are greater than 100 microns in length. Panel c also illustrates that these cells have a different 
morphology from normally elongated 3T3 cells. 

Figure 9. N Terminal Truncations Result in a Loss of PST PIP Binding to PTP HSCF. Full 
length PST PIP (Spencer et ai, J. Cell Biol., 138(4):845-860 (1997)) and forms missing 25 (delta 25), 50 

25 (delta 50) and 75 (delta 75) amino acids from the N-terminus were transcribed and translated in vitro. The 
upper panel illustrates immunoprecipitation of the proteins with an antibody directed against a PST PIP C- 
terminal FLAG epitope. The lower panel illustrates the same proteins precipitated with a GST fusion protein 
containing the C-terminal 149 amino acids of PTP HSCF (Cheng et ai, Oncogene 13:2275-2279 (1996)) 
including the C-terminal proline rich binding site. 

30 Figure 10. Confocal Laser Scanning Microscopy of Various Forms of PST PIP Expressed in 

Chinese Hamster Ovary (CHO) Cells. CHO cells were transfected with piasmids expressing various forms 
of PST PIP, and the cells were subsequently stained with a rhodamine-conjugated monoclonal antibody 
directed against a PST PIP C-terminal FLAG epitope. Cells were counter-stained with FITC-conjugated 
phalloidin to illuminate F-actin. A. Wild type (full length) PST PIP, B. PST PIP missing the N-terminal 25 

35 amino acids, C. PST PIP missing the N-terminal 50 amino acids, D. PST PIP missing the N-terminal 75 amino 
acids, E. PST PIP with an alanine replacement mutation at tryptophan232. 

Figure 1 1. W232A in PST PIP abolishes the PTP HSCF interaction in vitro. Wild type and 
mutant forms of PST PIP were transcribed and translated in vitro. The upper panel illustrates 
immunoprecipitation of the proteins with an antibody directed against a PST PIP C-terminal FLAG epitope. 
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The lower panel illustrates the same proteins precipitated with a GST fusion protein containing the C-terminal 
149 amino acids of PTP HSCF including the C-terminal proline rich binding site. The W232A mutation in 
abolishes the interaction between PST PIP and PTP HSCF, while the other mutations, which were selected 
because of their similarity to those found in WW-type domains (Chen etaL, J. Biol Chem. 272(27): 17070- 
5 17077 (1997)) have little effect on binding. 

Figure 12. In vivo analysis of W232A mutant PST PIP interactions with wild type and 
dominant negative ("substrate trapping") forms of PTP HSCF. COS cell transfections were performed 
with plasmids encoding the proteins indicated at the top of the figure. Cell lysates were immunoprecipitated 
with anti-FLAG antibody (specific for PST PIP) or anti-HA antibody (specific for PTP HSCF). The resultant 
10 precipitates were resolved on SDS polyacrylamide gels and probed with anti-FLAG antibody (to detect PST 
PIP), anti-HA antibody (to detect PTP HSCF) and anti-pY (anti-phosphotyrosine antibody) to detect this 
modified residue on these proteins. Note that the W232A mutant of PST PIP is neither coprecipitated with 
PTP HSCF nor is it "substrate trapped" (as determined by hyperphosphorylation) by the dominant negative 
(C-S) form of PTP HSCF, while the wild type form of the protein (PST PIP wt) is found in a complex with 
15 precipitated PTP HSCF and is hyperphosphoryiated ("substrate trapped" (Jia et al. Science 268(521 8): 1754- 
1758 (1995); Garton et al, Mol Cell Biol 16(1 1):6408-6418 (1996); Flint ei al, Proc. Natl Acad. Sci USA 
94(5): 1680- 1685 (1997); Spencer el al (1997), supra )) by dominant negative (C-S) PTP HSCF. 

Figure 13. Alanine scanning mutagenesis of the C-terminally derived PTP HSCF peptide. 
Peptides derived from the C-terminus of PTP HSCF with alanines at each of the indicated positions were 
20 analyzed for their ability to inhibit the interaction between in vitro transcribed and translated PST PIP and the 
GST PTP HSCF fusion protein. Inhibition of the interaction results in a loss of binding to the GST fusion and 
a lack of signal on the gel, while peptides with decreased inhibitory activity are poorly competitive and allow 
for binding. Also illustrated is the sequence of this C-terminal region in the peptide used for this analysis, 
PTP HSCF (Cheng et al (1996), supra ) as well as PTP PEST (Garton and Tonks, EMBOJ. 13(16): 3763- 
25 3771 (1994)) and PTP PEP (Matthews et al (1992), supra ). 

Figure 14. In vitro and in vivo analysis of mutations of the C-terminus of PTP HSCF. A/1. 
PTP HSCF constructs containing alanine substitutions at the indicated positions were transcribed and translated 
vitro, and the resultant mutant proteins were precipitated with either 1 (top panel) or 10 (second panel) ug/ml 
GST PTP PIP. The third panel illustrates immunoprecipitations of the in vitro transcribed and translated PTP 
30 HSCF mutants with a monoclonal antibody directed against an N-terrninal HA tag to insure that all the mutants 
were produced. Multiple bands precipitated with the GST PST PIP fusion protein are apparently C-terminal 
proteolyzed products of the PTP. Also shown are GST PST PIP precipitations (10 ug/ml) performed on a 
double mutant of PTP HSCF (R+W) with both residues R444 and W450 mutated to alanine (bottom panel Fig. 
14 A/2). B. COS cells were co-transfected with a 10:1 ratio of plasmids encoding hemagglutinin (HA) tagged 
35 PTP HSCF with the illustrated alanine substitutions or a mutant of PTP HSCF deleted for the C-terminal 24 
amino acids (PTP HSCFD24) and wild type PST PIP with a C-terminal FLAG epitope tag, respectively. 
Transfected cell lysates were immunoprecipitated with anti-HA monoclonal antibodies and the precipitates 
were blotted with anti-PST PIP polyclonal antibody to detect the relative amounts of PST PIP complexed with 
either wild type or different mutant forms of PTP HSCF. Lysates were also immunoprecipitaed with anti-HA 
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antibody and blotted with the same antibody to insure equal expression of PTP HSCF. Equal expression of 
PSTPIP was determined by immunoprecipitating lysates with anti-FLAG monoclonal antibody and blotting 
with anti-PST PIP polyclonal antibody. Note the complete loss of coprecipitation in both the PTP HSCFD24 
and PTP HSCF444+450 mutants. 
5 Figure 15- W232A PST PIP is tyrosine phosphorylated more efficiently in the presence of v- 

Src. Equivalent amounts of plasmids encoding either wild type or W232A mutant PST PIP were transfected 
into COS cells in the presence of increasing quantities of a plasmid encoding the v-Src tyrosine kinase. PST 
PIP was immunoprecipitated with anti-FLAG antibody and blotted with either a polyclonal antibody directed 
against PST PIP (top panel) or an anti-phosphotyrosine antibody (bottom panel). Densitometric analysis of 
1 0 the top panel revealed —1 0-15% differences in signals obtained between the wild type or W232A mutant forms 
of PST PIP, while densitometry of the bottom panels showed 2-3 fold higher levels of phosphotyrosine in the 
mutant form of the protein (data not shown), depending upon the dose of v-Src plasmid added. 

DETAILED DESCRIPTION OF THE INVENTION 

A. Definitions 

1 5 The phrase TSTPIP polypeptide". "PSTPIP", "PST Phosphatase Interacting Protein' 1 and "PTP 

HSCF interacting protein" are used interchangeably and refer to a polypeptide which comprises the amino acid 
sequence of the PSTPIP polypeptide shown in Fig. 1 A (SEQ ID NO:l) or a further mammalian homologue 
thereof. The above terms are also intended to encompass functional polypeptides encoded by nucleic acid 
which hybridizes under stringent conditions to nucleic acid which encodes a polypeptide comprising the amino 

20 acid sequence of the PSTPIP polypeptide shown in Fig. 1A (SEQ ID NO:2) or a further mammalian 
homologue thereof as well as functional derivatives of any of the above polypeptides. 

By "further mammalian homologue" or grammatical equivalents thereof is meant a PSTPIP 
polypeptide from a mammalian species other than murine which is functionally similar to the PSTPIP 
polypeptide shown in Fig. 1 A (SEQ ID NO:1). Such PSTPIP homologues may be identified in such mammals 

25 as, for example, human, rabbit, rat, porcine, non-human primates, equine and ovine. Screening cDNA libraries 
prepared from these mammals with a probe derived from the nucleic acid encoding the murine PSTPIP 
polypeptide shown in Fig 1 A (SEQ ID NO:2) will allow identification of such homologues, such as the human 
homologue (SEQ ID NOS.: 28 and 29). 

The term "native PSTPIP polypeptide" in this context refers to a naturally occurring PSTPIP 

30 polypeptide, having the described properties, of any human or non-human animal species, with or without the 
initiating methionine, whether purified from the native source, synthesized, produced by recombinant DNA 
technology or by any combination of these and/or other methods. Native PSTPIP polypeptide specifically 
includes the native murine PSTPIP protein shown in Fig. 1 A (SEQ ID NO:l), and the native human PST PIP 
protein (SEQ ID NO:29). 

35 A "functional derivative" of a polypeptide is a compound having a qualitative biological activity 

in common with the native polypeptide. Thus, a functional derivative of a native PSTPIP polypeptide is a 
compound that has a qualitative biological activity in common with a native PSTPIP polypeptide, for example, 
as being capable of binding to a member of the PEST-type protein tyrosine phosphatase family and/or being 
dephosphorylated by a member of the PEST-type protein tyrosine phosphatases when having at least one 
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phosphorylated tyrosine residue and/or associating with actin. "Functional derivatives" include, but are not 
limited to, fragments of native polypeptides from any animal species (including humans), derivatives of native 
(human and non-human) polypeptides and their fragments, glycosyiation variants of a native polypeptide, and 
peptide and non-peptide analogs of native polypeptides, provided that they have a biological activity in 
5 common with a respective native polypeptide. "Fragments" comprise regions within the sequence of a mature 
native polypeptide. The term "derivative" is used to define amino acid sequence variants (insertional, 
deletional and substitutional), and covalent modifications of a native polypeptide. "Non-peptide analogs" are 
organic compounds which display substantially the same surface as peptide analogs of the native polypeptides. 
Thus, the non-peptide analogs of the native PSTPIP polypeptide of the present invention are organic 

10 compounds which display substantially the same surface as peptide analogs of the native PSTPIP. Such 
compounds interact with other molecules in a similar fashion as the peptide analogs, and mimic a biological 
activity of a native PSTPIP of the present invention. The polypeptide functional derivatives of the native 
PSTPIP of the present invention preferably have at least about 65%, more preferably at least about 75%, even 
more preferably at least about 85%, most preferably at least about 95% overall sequence homology with the 

15 PSTPIP amino acid sequence shown in Fig. 1 A (SEQ ID NO:l) and substantially retain the ability to bind to 
a member of the PEST-type protein tyrosine phosphatases. 

The term "biological activity" in the context of the definition of functional derivatives is defined 
as the possession of at least one physiological function qualitatively in common with a native polypeptide. 
The functional derivatives of the native PSTPIP of the present invention are unified by their qualitative ability 

20 to bind to a member of the PEST-type protein tyrosine phosphatases. 

By "PEST-type protein tyrosine phosphatase" is meant a protein tyrosine phosphatase enzyme 
which possesses a non-catalytic domain comprising a variable sized region that is rich in proline, serine and 
threonine residues and a C-terminal 20 amino acid segment which is rich in proline residues and which defines 
at least one potential SH3 binding domain [Pawson, Nature 373:573-580 (1995)]. Included within the PEST- 

25 type protein tyrosine phosphatase family are the protein tyrosine phosphatases PTP PEST [Yang et aL ( 1 993) 
suora l. PTP PEP [Matthews et aL (1992) supra ]. PTP HSCF [Cheng et aL ( 1996) supra ]: also known as PTP- 
Kl [Huang et aL (1996) supra], PTP20 [Aoki et aL, (1996) supra] or FLP1 [Dosil et aL, (1996) supra ] and 
PTP BDP1 [Kim et aL, (1996) supra]. 

The term "agonist" is used to refer to peptide arid non-peptide analogs of the native PSTPIP 

30 polypeptides of the present invention and to antibodies specifically binding native PSTPIP provided that they 
retain at least one biological activity of a native PSTPIP. Preferably, the agonists of the present invention 
retain the qualitative ability to bind to a member of the PEST-type protein tyrosine phosphatases and/or induce 
the polymerization of actin monomers. 

The term "antagonist" is used to refer to a molecule inhibiting a biological activity of a native 

35 PSTPIP polypeptide of the present invention. Preferably, the antagonists herein inhibit the ability of the 
PSTPIP polypeptide of the present invention to bind to members of the PEST-type protein tyrosine 
phosphatase enzymes. It is also preferred that antagonists inhibit the ability of the PSTPIP polypeptide to 
induce the polymerization of actin monomers. 
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Agonist and antagonist candidates may comprise a variety of different compounds including 
peptides, proteins, organic molecules, and the like. For example, it is well within the skill level in the art to 
prepare combinatorial oligopeptide libraries and screen those libraries for members which either bind to the 
PSTPIP polypeptide or which interfere with the binding of a PSTPIP polypeptide to a member of the PEST- 

5 type protein tyrosine phosphatases. 

"Identity" or "homology" with respect to a native polypeptide and its functional derivative is 
defined herein as the percentage of amino acid residues in the candidate sequence that are identical with the 
residues of a corresponding native polypeptide, after aligning the sequences and introducing gaps, if necessary, 
to achieve the maximum percent homology, and not considering any conservative substitutions as part of the 

0 sequence identity. Neither N- or C-terminal extensions nor insertions shall be construed as reducing identity 
or homology. Methods and computer programs for the alignment are well known in the art. 

Ordinarily, the terms "amino acid" and "amino acids" refer to all naturally occurring L-oc-amino 
acids. In some embodiments, however, D-amino acids may be present in the polypeptides or peptides of the 
present invention in order to facilitate conformational restriction. For example, in order to facilitate disulfide 

15 bond formation and stability, a D amino acid cysteine may be provided at one or both termini of a peptide 
functional derivative or peptide antagonist of the native PSTPIP polypeptide of the present invention. The 
amino acids are identified by either the single-letter or three-letter designations: 





Asp 


D 


aspartic acid 


He 


I 


isoleucine 




Thr 


T 


threonine 


Leu 


L 


leucine 
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Ser 


S 


serine 


Tyr 


Y 


tyrosine 




Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 




Pro 


P 


proline 


His 


H 


histidine 




Gly 


G 


glycine 


Lys 


K 


lysine 




Ala 


A 


alanine 


Arg 


R 


arginine 
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Cys 


C 


cysteine 


Trp 


W 


tryptophan 




Val 


V 


valine 


Gin 


Q 


glutamine 




Met 


M 


methionine 


Asn 


N 


asparagine 



These amino acids may be classified according to the chemical composition and properties of their 
side chains. They are broadly classified into two groups, charged and uncharged. Each of these groups is 
30 divided into subgroups to classify the amino acids more accurately: 

I. Charged Amino Acids 

Acidic Residues : aspartic acid, glutamic acid 
Basic Residues : lysine, arginine, histidine 

II. Uncharged Amino Acids 

35 Hvdrophilic Residues : serine, threonine, asparagine, glutamine 

Aliphatic Residues : glycine, alanine, valine, leucine, isoleucine 
Non-polar Residues : cysteine, methionine, proline 
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Aromatic Residues : phenylalanine, tyrosine, tryptophan 

The term "amino acid sequence variant" refers to molecules with some differences in their amino 
acid sequences as compared to a native amino acid sequence. 

Substitutional variants are those that have at least one amino acid residue in a native sequence 
removed and a different amino acid inserted in its place at the same position. The substitutions may be single, 
where only one amino acid in the molecule has been substituted, or they may be multiple, where two or more 
amino acids have been substituted in the same molecule. 

lnsertional variants are those with one or more amino acids inserted immediately adjacent to an 
amino acid at a particular position in a native sequence. Immediately adjacent to an amino acid means 
connected to either the a-carboxy or <x -amino functional group of the amino acid. 

Deletional variants are those with one or more amino acids in the native amino acid sequence 
removed. Ordinarily, deletional variants will have one or two amino acids deleted in a particular region of 
the molecule. 

"Antibodies (Abs)" and "immunoglobulins (Igs) M are glycoproteins having the same structural 
characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include 
both antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter 
kind are, for example, produced at low levels by the lymph system and at increased levels by myelomas. 

Native antibodies and immunoglobulins are usually heterotetrameric glycoproteins of about 
150,000 daltons, composed of two identical light (L) chains and two identical heavy (H) chains. Each light 
chain is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies 
between the heavy chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly 
spaced intrachain disulfide bridges. Each heavy chain has at one end a variable domain (V H ) followed by a 
number of constant domains. Each light chain has a variable domain at one and (V L ) and a constant domain 
at its other end: the constant domain of the light chain is aligned with the first constant domain of the heavy 
chain, and the light chain variable domain is aligned with the variable domain of the heavy chain. Particular 
amino acid residues are believed to form an interface between the light and heavy chain variable domains 
(Clothia et al„ J. MoL Biol. 1 86:65 1-663 (1 985) and Novotny and Haber, Proc. NatL Acad ScL USA 82:4592- 
4596(1985)). 

The term "variable" refers to the fact that certain portions of the variable domains differ extensively 
in sequence among antibodies and are used in the binding and specificity of each particular antibody for its 
particular antigen. However, the variability is not evenly distributed through the variable domains of 
antibodies. It is concentrated in three segments called complementarity determining regions (CDRs) or 
hypervariable regions both in the light chain and the heavy chain variable domains. The more highly 
conserved portions of variable domains are called the framework (FR). The variable domains of native heavy 
and light chains each comprise four FR regions, largely adopting a P-sheet configuration, connected by three 
CDRs, which form loops connecting, and in some cases forming part of, the P-sheet structure. The CDRs in 
each chain are held together in close proximity by the FR regions and, with the CDRs from the other chain, 
contribute to the formation of the antigen binding site of antibodies (see Kabat et al., Sequences of Proteins 
of Immunological Interest, National Institute of Health, Bethesda, MD (1991)). The constant domains are not 
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involved directly in binding an antibody to an antigen, but exhibit various effector functions, such as 
participation of the antibody in antibody-dependent cellular toxicity. 

Papain digestion of antibodies produces two identical antigen binding fragments, called Fab 
fragments, each with a single antigen binding site, and a residual "Fc" fragment, whose name reflects its ability 
5 to crystallize readily. Pepsin treatment yields an F(ab') 2 fragment that has two antigen combining sites and 
is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen recognition and binding 
site. This region consists of a dimer of one heavy and one light chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen 
1 0 binding site on the surface of the V H -V L dimer. Collectively, the six CDRs confer antigen binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs 
specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the 
entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
I 5 (CH 1) of the heavy chain. Fab' fragments differ from Fab fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains 
bear a free thiol group. F(ab') 2 antibody fragments originally were produced as pairs of Fab' fragments which 
have hinge cysteines between them. Other, chemical couplings of antibody fragments are also known. 
20 The light chains of antibodies (immunoglobulins) from any vertebrate species can be assigned to 

one of two clearly distinct types, called kappa (k) and lambda (A), based on the amino acid sequences of their 
constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, 

25 IgD, IgE, IgG and IgM, and several of these may be further divided into subclasses (isotypes), e.g. IgG-1, IgG- 
2, IgG-3. and IgG-4; IgA-l and lgA-2. The heavy chain constant domains that correspond to the different 
classes of immunoglobulins are called a, delta, epsilon, y, and u, respectively. The subunit structures and 
three-dimensional configurations of different classes of immunoglobulins are well known. 

The term "antibody" is used in the broadest sense and specifically covers single monoclonal 

30 antibodies (including agonist and antagonist antibodies), antibody compositions with polyepitopic specificity, 
as well as antibody fragments (e.g., Fab, F(ab') 2 , and Fv), so long as they exhibit the desired biological 
activity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population 
of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
35 except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal 
antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to 
conventional (polyclonal) antibody preparations which typically include different antibodies directed against 
different determinants (epitopes), each monoclonal antibody is directed against a single determinant on the 
antigen. In addition to their specificity, the monoclonal antibodies are advantageous in that they are 
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synthesized by the hybridoma culture, uncontaminated by other immunoglobulins. The modifier 
"monoclonal" indicates the character of the antibody as being obtained from a substantially homogeneous 
population of antibodies, and is not to be construed as requiring production of the antibody by any particular 
method. For example, the monoclonal antibodies to be used in accordance with the present invention may be 

5 made by the hybridoma method first described by Kohler & Milstein, Nature 256:495 (1975), or may be made 
by recombinant DNA methods (see, e.g. U.S. Patent No. 4,816,567 by Cabilly et al.). 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) 
in which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences 
in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while 

1 0 the remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies derived 
from another species or belonging to another antibody class or subclass, as well as fragments of such 
antibodies, so long as they exhibit the desired biological activity (U.S. Patent No. 4,8 16,567 by Cabilly et al.; 
Morrison et al., Proc. Natl. Acad. Sci. USA 81:6851-6855 (1984)). 

"Humanized" forms of non-human (e.g. murine) antibodies are chimeric immunoglobulins. 

15 immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. For 
the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues from 
a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 

20 capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Furthermore, humanized antibody may comprise residues which are 
found neither in the recipient antibody nor in the imported CDR or framework sequences. These modifications 
are made to further refine and optimize antibody performance. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all 

25 of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR 
regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also 
will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. For further details see: Jones et al., Nature 321 :522-525 (1986); Reichmann et al., Nature 
332:323-329 1988; EP-B-239 400 published 30 September 1987; Presta, Curr. Op. Struct. Biol. 2:593-596 

30 (1992); and EP-B-451 216 published 24 January 1996. 

In the context of the present invention the expressions "cell", "cell line", and "cell culture" are used 
interchangeably, and alt such designations include progeny. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the 
same function or biological property, as screened for in the originally transformed cell, are included. 

35 The terms "replicable expression vector" and "expression vector" refer to a piece of DNA, usually 

double-stranded, which may have inserted into it a piece of foreign DNA. Foreign DNA is defined as 
heterologous DNA, which is DNA not naturally found in the host cell. The vector is used to transport the 
foreign or heterologous DNA into a suitable host cell. Once in the host cell, the vector can replicate 
independently of the host chromosomal DNA, and several copies of the vector and its inserted (foreign) DNA 
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may be generated. In addition, the vector contains the necessary elements that permit translating the foreign 
DNA into a polypeptide. Many molecules of the polypeptide encoded by the foreign DN A can thus be rapidly 
synthesized. 

The term "control sequences" refers to DNA.sequences necessary for the expression of an operably 
5 linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, a ribosome binding site, and possibly, other 
as yet poorly understood sequences. Eukaryotic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 

10 nucleic acid sequence. For example, DNA for a presequence or a secretory leader is operably linked to DNA 
for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; 
or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous and, in 

15 the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, then 
synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice. 

"Immunoadhesins" or "PSTPIP - immunoglobulin chimeras" are chimeric antibody-like molecules 
that combine the functional domain(s) of a binding protein (usually a receptor, a cell-adhesion molecule or 

20 a ligand) with the an immunoglobulin sequence. The most common example of this type of fusion protein 
combines the hinge and Fc regions of an immunoglobulin (Ig) with domains of a protein that recognizes and 
binds to a specific ligand. This type of molecule is called an "immunoadhesin", because it combines 
"immune" and "adhesion" functions; other frequently used names are "Ig-chimera", "Ig-" or "Fc-fusion 
protein", or "receptor-globulin." 

25 "Oligonucleotides" are short length, single- or double-stranded poiydeoxynucleotides that are 

chemically synthesized by known methods such as phosphotriester, phosphite, or phosphoramidite chemistry, 
using solid phase techniques such as those described in EP 266,032, published 4 May 1988, or via 
deoxynucleoside H-phosphonate intermediates as described by Froehler et al., Nuci. Acids Res. 14:5399 
(1 986). They are then purified on po ly aery lam ide gels. 

30 Hybridization is preferably performed under "stringent conditions" which means (1) employing low 

ionic strength and high temperature for washing, for example, 0.015 sodium chloride/0.0015 M sodium 
citrate/0.1% sodium dodecyl sulfate at 50° C, or (2) employing during hybridization a denaturing agent, such 
as formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% 
polyvinylpyrrolidone/50 nM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM 

35 sodium citrate at 42°C. Another example is use of 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium 
citrate), 50 mM sodium phosphate (pH 6/8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated 
salmon sperm DNA (50 ug/rnl), 0. 1 % SDS, and 1 0% dextran sulfate at 42° C, with washes at 42° C in 0.2 x 
SSC and 0.1% SDS. Yet another example is hybridization using a buffer of 10% dextran sulfate, 2 x SSC 
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(sodium chloride/sodium citrate) and 50% formamide at 55° C, followed by a high-stringency wash consisting 
of 0. 1 x SSC containing EDTA at 55° C. 

"Transformation" means introducing DNA into an organism so that the DNA is replicable, either 
as an extrachromosomal element or by chromosomal integration. Depending on the host cell used, 
5 transformation is done using standard techniques appropriate to such cells. The calcium treatment employing 
calcium chloride, as described by Cohen, Proc. Natl. Acad. ScL USA 69:21 10 (1972) and Mandel et al., J. Mol 
Biol. 53:154 (1970), is generally used for prokaryotes or other cells that contain substantial cell-wall barriers. 
For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van 
der Eb, Virology 52:456-457 (1978) is preferred. General aspects of mammalian cell host system 
10 transformations have been described by Axel in U.S. Pat. No. 4,399,216 issued August 16, 1983. 
Transformations into yeast are typically carried out according to the method of Van Solingen et aL J. Bact. 
130:946 (1977) and Hsiao et al., Proc. Natl. Acad. Sci. USA 76:3829 (1979). However, other methods for 
introducing DNA into cells such as by nuclear injection, electroporation or by protoplast fusion may also be 
used. 

15 "Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation 

of the digest on polyacrylamide or agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular weight, removal of the 
gel section containing the desired fragment, and separation of the gel from DNA. This procedure is known 
generally. For example, see Lawn et al., Nucleic Acids Res. 9:6103-61 14 (1981) and Goeddel et al.. Nucleic 

20 Acids Res. 8:4057 (1980). 

"Ligation" refers to the process of forming phosphodiester bonds between two double stranded 
nucleic acid fragments (Maniatis et aL, (1982) supra ). Unless otherwise provided, ligation may be 
accomplished using known buffers and conditions with 10 units of T4 DNA ligase ("ligase") per 0.5 mg of 
approximately equimoiar amounts of the DNA fragments to be ligated. 

25 "Preparation" of DNA from transformants means isolating plasrnid DNA from microbial culture. 

Unless otherwise provided, the alkaline/SDS method of Maniatis et al., (1982) supra , may be used. 
B. Production of PSTPIP Polypeptides bv Recombinant DNA Technology 

1. Identification and Isolation of Nucleic Acid Encoding PSTPIP 

Nucleic acids encoding the native PSTPIP proteins of the present invention may be isolated from 

30 cDNA or genomic libraries. For example, a suitable cDNA library can be constructed by obtaining 
polyadenylated mRNA from cells known to express the desired PSTPIP protein (for example Baf3, available 
through the American Type Culture Collection), and using the mRNA as a template to synthesize double 
stranded cDNA. mRNA encoding the native PSTPIP of the present invention is expressed, for example, in 
tissues derived from adult lung and spleen as well as in very early 7 day murine embryos. The gene encoding 

35 the novel PSTPIP polypeptide of the present invention can also be obtained from a genomic library, such as 
a human genomic cosmid library, or a mouse-derived embryonic cell (ES) genomic library. 

Libraries, either cDNA or genomic, are then screened with probes designed to identify the gene of 
interest or the protein encoded by it. For cDNA expression libraries, suitable probes include monoclonal and 
polyclonal antibodies that recognize and specifically bind to a PSTPIP polypeptide. For cDNA libraries, 
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suitable probes include carefully selected oligonucleotide probes (usually of about 20-80 bases in length) that 
encode known or suspected portions of a PSTPIP polypeptide from the same or different species, and/or 
complementary or homologous cDNAs or fragments" thereof that encode the same or a similar gene. 
Appropriate probes for screening genomic DNA libraries include, without limitation, oligonucleotides, 
5 cDNAs, or fragments thereof that encode the same or a similar gene, and/or homologous genomic DNAs or 
fragments thereof. Screening the cDNA or genomic library with the selected probe may be conducted using 
standard procedures as described in Chapters 10-12 of Sambrook et aL Molecular Cloning: A Laboratory' 
Manual, New York, Cold Spring Harbor Laboratory Press, 1989. 

If DNA encoding a polypeptide of the present invention is isolated by using carefully selected 

10 oligonucleotide sequences to screen cDNA libraries from various tissues, the oligonucleotide sequences 
selected as probes should be sufficient in length and sufficiently unambiguous that false positives are 
minimized. The actual nucleotide sequence(s) is/are usually designed based on regions which have the least 
codon redundance. The oligonucleotides may be degenerate at one or more positions. The use of degenerate 
oligonucleotides is of particular importance where a library is screened from a species in which preferential 

15 codon usage is not known. 

The oligonucleotide must be labeled such that it can be detected upon hybridization to DNA in the 
library being screened. The preferred method of labeling is to use ATP (e.g., y^ 2 P) and polynucleotide kinase 
to radiolabel the 5* end of the oligonucleotide. However, other methods may be used to label the 
oligonucleotide, including, but not limited to, biotinylation or enzyme labeling. 

20 cDNAs encoding PSTPIP polypeptides can also be identified and isolated by other known 

techniques of recombinant DNA technology, such as by direct expression cloning, or by using the polymerase 
chain reaction (PCR) as described in U.S. Patent No. 4,683,195, issued 28 July 1987, in section 14 of 
Sambrook et al., supra , or in Chapter 15 of Current Protocols in Molecular Biology, Ausubel et al. eds., 
Greene Publishing Associates and Wiley-interscience (1991). 

25 Once cDN A encoding a PSTPIP polypeptide from one species has been isolated. cDNAs from other 

species can also be obtained by cross-species hybridization. According to this approach, human or other 
mammalian cDNA or genomic libraries are probed by labeled oligonucleotide sequences selected from known 
PSTPIP sequences (such as murine PSTPIP) in accord with known criteria, among which is that the sequence 
should be sufficient in length and sufficiently unambiguous that false positives are minimized. Typically, a 

30 32 P-labeled oligonucleotide having about 30 to 50 bases is sufficient, particularly if the oligonucleotide 
contains one or more codons for methionine or tryptophan. Isolated nucleic acid will be DNA that is identified 
and separated from contaminant nucleic acid encoding other polypeptides from the source of nucleic acid. 
Hybridization is preferably performed under "stringent conditions", as herein above defined. 

Once the sequence is known, the gene encoding a particular PSTPIP polypeptide can also be 

35 obtained by chemical synthesis, following one of the methods described in Engels and Uhimann, Agnew. 
Chem. Int. Ed EngL 28:716 (1989). These methods include triester, phosphite, phosphoramidite and H- 
phosphonate methods, PCR and other autoprimer methods, and oligonucleotide syntheses on solid supports. 
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2. CI ning and Expression of Nucleic Acid Encoding PSTPIP 

Once the nucleic acid encoding PSTPIP is available, it is generally ligated into a replicable 
expression vector for further cloning (amplification of the DNA), or for expression. 

Expression and cloning vectors are well known in the art and contain a nucleic acid sequence that 

5 enables the vector to replicate in one or more selected host cells. The selection of the appropriate vector will 
depend on 1) whether it is to be used for DNA amplification or for DNA expression, 2) the size of the DNA 
to be inserted into the vector, and 3) the host cell to be transformed with the vector. Each vector contains 
various components depending on its function (amplification of DNA of expression of DNA) and the host cell 
for which it is compatible. The vector components generally include, but are not limited to, one or more of 

10 the following: a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a 
promoter, and a transcription termination sequence. Construction of suitable vectors containing one or more 
of the above listed components, the desired coding and control sequences, employs standard ligation 
techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in the form desired to 
generate the plasmids required. For analysis to confirm correct sequences in plasmids constructed, the ligation 

15 mixtures are commonly used to transform E. colt cells, e.g. E. coli K12 strain 294 (ATCC 31,446) and 
successful transforrnants selected by ampicillin or tetracycline resistance where appropriate. Plasmids from 
the transforrnants are prepared, analyzed by restriction endonuclease digestion, and/or sequenced by the 
method of Messing et al.. Nucleic Acids Res. 9:309 (1981) or by the method of Maxam et al., Methods in 
Enzymology 65:499 (1980). 

20 The polypeptides of the present invention may be expressed in a variety of prokaryotic and 

eukaryotic host cells. Suitable prokaryotes include gram negative or gram positive organisms, for example 
E. coli or bacilli. A preferred cloning host is E. coJi 294 (ATCC 3 1 ,446) although other gram negative or gram 
positive prokaryotes such as E. coh B, E. coh X1776 (ATCC 31,537), E. coH W3110 (ATCC 27,325), 
Pseudomonas species, or Serratia Marcesans are suitable. 

25 In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts 

for vectors herein. Saccharomvces cerevisiae . or common baker's yeast, is the most commonly used among 
lower eukaryotic host microorganisms. However, a number of other genera, species and strains are commonly 
available and useful herein, such as S. pombe (Beach and Nurse, Nature 290:140 (1981)), Kluyveromyces 
lactis (Louvencourt et al., J. BacterioL 731 (1983)); varrowia (EP 402,226); Pichia pastoris (EP 183,070), 

30 Trichoderma reesia (EP 244,234), Neurospora crassa (Case et al., Proc. Natl. Acad. ScL USA 76:5259-5263 
(1979)) and Aspergillus hosts such as A. nidulans (Ballance et al., Biochem. Biophys. Res. Commun. 1 12:284- 
289 (1983); Tilburn et al., Gene 26:205-221 (1983); Yelton et al., Proc. Natl Acad. ScL USA 81: 1470-1474 
(1984)) and A. nieer (Kelly and Hynes, EM BO J. 4:475-479 (1985)). 

Suitable host cells may also derive from multicellular organisms. Such host cells are capable of 

35 complex processing and glycosylation activities. In principle, any higher eukaryotic cell culture is workable, 
whether from vertebrate or invertebrate culture, although cells from mammals such as humans are preferred. 
Examples of invertebrate cells include plants and insect cells. Numerous baculoviral strains and variants and 
corresponding permissive insect host cells from hosts such as Spodoptera frugiperda (caterpillar), Aedes 
aegypti (mosquito), Aedes albooictus (mosquito), Drosophila melanoeaster (fruitfly), and Bombvx mori host 
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cells have been identified. See, e.g. Luckow et al., Bio/Technology 6:47-55 (1988); Miller et al., in Genetic 
Engineering, Setlow, J.K. et al., eds., Vol. 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda et al., Nature 
3 15:592-594 (1 985). A variety of such viral strains are publicly available, e.g. the L- 1 variant of Autographa 
californica NPV, and such viruses may be used as the virus herein according to the present invention, 
5 particularly for transfection of Spodoptera frueiperda cells. 

Plant cell cultures of cotton, com, potato, soybean, petunia, tomato, and tobacco can be utilized as 
hosts. Typically, plant cells are transfected by incubation with certain strains of the bacterium Aerobacterium 
tumefaciens . which has been previously manipulated to contain the PSTPIP DNA. During incubation of the 
plant cell culture with A. tumefaciens . the DNA encoding a PSTPIP polypeptide is transferred to the plant cell 

1 0 host such that it is transfected, and will, under appropriate conditions, express the PSTPIP DNA. In addition, 
regulatory and signal sequences compatible with plant cells are available, such as the nopaline synthase 
promoter and polyadenylation signal sequences. Depicker et al., J. Mol Appl Gen. 1:561 (1982). In addition, 
DNA segments isolated from the upstream region of the T-DNA 780 gene are capable of activating or 
increasing transcription levels of plant-expressible genes in recombinant DNA-containing plant tissue. See 

15 EP 321,196 published 21 June 1989. 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) is per se well known. See Tissue Culture. Academic Press, Kruse and Patterson, editors (1973). 
Examples of useful mammalian host cell lines are monkey kidney CV1 line transformed by SV40 (COS-7, 
ATCC CRL 1651); human embryonic kidney cell line (293 or 293 cells subcloned for growth in suspension 

20 culture, Graham et al., J. Gen Virol 36:59 (1977)); baby hamster kidney cells 9BHK, (ATCC CCL 10); 
Chinese hamster ovary cells/-DHFR [CHO, Urlaub and Chasin, Proc. NatL Acad. Sci. USA 77:4216 (1980)]; 
mouse sertolli cells [TM4, Mather, Biol Reprod. 23:243-251 (1980)]; monkey kidney cells (CV1 ATCC CCL 
70); African green monkey kidney cells (VERO-76, ATCC CRL- 1587); human cervical carcinoma cells 
(HELA. ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC 

25 CRL 1442); human lung cells (W138, ATCC CCL75); human liver cells (Hep G2, HB 8065); mouse 
mammary tumor (MMT 060562, ATCC CCL5 1 ); TRI ceils [Mather et al.. Annals N. Y. Acad. Sci. 383:44068 
(1982)]; MRC 5 cells; FS4 cells; and a human hepatoma cell line (Hep G2). Preferred host cells are human 
embryonic kidney 293 and Chinese hamster ovary cells. 

Particularly useful in the practice of this invention are expression vectors that provide for the 

30 transient expression in mammalian cells of DNA encoding a PSTPIP polypeptide. In general, transient 
expression involves the use of an expression vector that is able to replicate efficiently in a host cell, such that 
the host cell accumulates many copies of the expression vector and, in turn, synthesizes high levels of a desired 
polypeptide encoded by the expression vector. Transient systems, comprising a suitable expression vector and 
a host cell, allow for the convenient positive identification of polypeptides encoded by clones DNAs, as well 

35 as for the rapid screening of such polypeptides for desired biological or physiological properties. Thus, 
transient expression systems are particularly useful in the invention for purposes of identifying analogs and 
variants of a PSTPIP polypeptide. 

Other methods, vectors, and host cells suitable for adaptation to the synthesis of the PSTPIP 
polypeptides in recombinant vertebrate cell culture are described in Getting et al., Nature 293:620-625 (1981); 
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Mantel et al., Nature 281:40-46 (1979); Levinson et al.; EP 1 17,060 and EP 1 17,058. Particularly useful 
plasmids for mammalian cell culture expression of the PSTP1P polypeptides are pRK5 (EP 307,247) or 
pSV16B (PCT Publication No. WO 91/08291). 

Other cloning and expression vectors suitable for the expression of the PSTPIP polypeptides of the 
5 present invention in a variety of host cells are. for example, described in EP 457,758 published 27 November 
1 991 . A large variety of expression vectors are now commercially available. An exemplary commercial yeast 
expression vector is pPJC.9 (Invitrogen), while an commercially available expression vector suitable for 
transformation of E. coli cells is PET15b (Novagen). 

C. CuKuring the Host Cells 

10 Prokaryote cells used to produced the PSTPIP polypeptides of this invention are cultured in suitable 

media as describe generally in Sambrook et al., supra . 

Mammalian cells can be cultured in a variety of media. Commercially available media such as 
Ham's F10 (Sigma), Minimal Essential Medium (MEM, Sigma), RPMI-1640 (Sigma), and Dulbecco's 
Modified Eagle's Medium (DMEM, Sigma) are suitable for culturing the host cells. In addition, any of the 

15 media described in Ham and Wallace, Meth. Enzymol. 58:44 (1979); Barnes and Sato, Anal. Biochem. 102:255 
(1980), US 4.767,704; 4,657,866; 4,927,762; or 4,560,655; WO 90/03430; WO 87/00195 or US Pat. Re. 
30,985 may be used as culture media for the host cells. Any of these media may be supplemented as necessary 
with hormones and/or other growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such 
as sodium chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides (such as 

20 adenosine and thymidine), antibiotics (such as Gentamycin™ drug) trace elements (defined as inorganic 
compounds usually present at final concentrations in the micromolar range), and glucose or an equivalent 
energy source. Any other necessary supplements may also be included at appropriate concentrations that 
would be known to those skilled in the art. The culture conditions, such as temperature, pH and the like, 
suitably are those previously used with the host cell selected for cloning or expression, as the case may be, and 

25 will be apparent to the ordinary artisan. 

The host cells referred to in this disclosure encompass cells in in vitro cell culture as well as cells 
that are within a host animal or plant. 

It is further envisioned that the PSTPIP polypeptides of this invention may be produced by 
homologous recombination, or with recombinant production methods utilizing control elements introduced 

30 into cells already containing DNA encoding the particular PSTPIP polypeptide. 

D. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. 
Nail. Acad ScL USA 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
35 appropriately labeled probe, based on the sequences provided herein. Various labels may be employed, most 
commonly radioisotopes, particularly JZ P. However, other techniques may also be employed, such as using 
biotin-modified nucleotides for introduction into a polynucleotide. The biotin then serves as a site for binding 
to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluoresces, 
enzymes, or the like. Alternatively, antibodies may be employed that can recognize specific duplexes, 
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including DNA duplexes, RNA duplexes, and DNA-RN A hybrid duplexes or DNA-protein duplexes. The 
antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to the surface, 
so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be 
detected. 

5 Gene expression, alternatively, may be measured by immunological methods, such as 

immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly 
the expression of gene product. With immunohistochemical staining techniques, a cell sample is prepared, 
typically by dehydration and fixation, followed by reaction with labeled antibodies specific for the gene 
product coupled, where the labels are usually visually detectable, such as enzymatic labels, fluorescent labels, 

10 luminescent labels, and the like. A particularly sensitive staining technique suitable for use in the present 
invention is described by Hse et al., Am. J. Clin. Pharm. 75:734-738 (1980). 

Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either 
monoclonal or polyclonal, and may be prepared in any animal. Conveniently, the antibodies may be prepared 
against a native PSTPIP polypeptide, or against a synthetic peptide based on the DNA sequence provided 

15 herein as described further hereinbelow. 

E. Amino Acid Sequence Variants of Native PSTPIP Polypeptides 

Amino acid sequence variants of native PSTPIP polypeptides are prepared by methods known in 
the art by introducing appropriate nucleotide changes into a PSTPIP DNA, or by in vitro synthesis of the 
desired polypeptide. There are two principal variables in the construction of amino acid sequence variants: 

20 the location of the mutation site and the nature of the mutation. With the exception of naturally-occurring 
alleles, which do not require the manipulation of the DNA sequence encoding the PSTPIP. the amino acid 
sequence variants of PSTPIP polypeptides are preferably constructed by mutating the DNA, either to arrive 
at an allele or an amino acid sequence variant that does not occur in nature. 

One group of the mutations will be created within the N-terminai coiled coil region of the 

25 polypeptides of the present invention. Non-conservative substitutions within this region may result in PSTPIP 
variants which lose their ability to be bound and/or dephosphorylated by PTP HSCF (or any other PEST PTP). 
PSTPIP variants mutated to alter their ability to associate with actin will be useful, for example, as inducers 
or inhibitors of cytokinesis. 

In the murine PSTPIP sequence (SEQ ID NO: 1 ) the coiled-coil domain is defined as extending from 

30 about amino acid position 30 to about amino acid position 261 (see Figure 1 A). However, in a broader sense, 
the coiled-coil domain may be viewed as starting at the N-terminus of the PSTPIP protein. Mutational analysis 
revealed that the six cysteine residues present within this region are not critical for the correct folding and 
function of PSTPIP. Unexpectedly, the tryptophan (W) residue at amino acid position 232 of the murine 
sequence was found to be critical for binding PTP HSCF. Mutation of this tryptophan residue to alanine 

35 resulted in a complete loss of binding. Accordingly, in order to retain biological activity, the tryptophan at 
position 232 must be retained, although substitution by other aromatic amino acids, e.g. tyrosine and 
phenylalanine might result in variants which retain their ability to bind PTP HSCF to some extent. 
Conversely, if variants that do not bind PTP HSCF are required, the tryptophan residue at position 232 of the 
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murine sequence, and corresponding residues in the PSTPIP proteins from other mammalian species, including 
human, are a primary target for substitution. 

While the tryptophan residue at position 232 of SEQ. ID. NO: 1 plays a critical role in PTP HSCF 
binding, the tryptophan at position 205, the phenylalanine at position 22 1 and leucine at position 224 are not 
5 critical, and can be readily mutated. 

Alternatively, or in addition, amino acid alterations can be made at sites that differ in PSTPIP 
proteins from various species, or in highly conserved regions, depending on the goal to be achieved. Sites at 
such locations will typically be modified in series, e.g. by (I) substituting first with conservative choices and 
then with more radical selections depending upon the results achieved, (2) deleting the target residue or 
10 residues, or (3) inserting residues of the same or different class adjacent to the located site, or combinations 
of options 1-3. One helpful technique is called "alanine scanning" (Cunningham and Wells, Science 244 . 
1081-1085 [1989]). 

Naturally-occurring amino acids are divided into groups based on common side chain properties: 
(1) hydrophobic: norleucine, met, ala ? val, leu, ile: 
15 (2) neutral hydrophobic: cys, ser, thr: 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

20 Conservative substitutions involve exchanging a member within one group for another member 

within the same group, whereas non-conservative substitutions will entail exchanging a member of one of 
these classes for another. Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative, i.e. differ more significantly in their effect on maintaining (a) the 
structure of the polypeptide backbone in the area of substitution, for example as a sheet or helical 

25 conformation, (b) the charge or hydrophobiciry of the molecule at the target site or (c) the bulk of the side 
chain. The substitutions which in general are expected to produce the greatest changes in the properties of the 
novel native PSTPIP polypeptides of the present invention will be those in which (a) a hydrophilic residue, 
e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, 
valyl or alanyi; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 

30 electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, 
e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or 
by) one not having a side chain, e.g. glycine. 

Amino acid sequence deletions generally range from about I to 30 residues, more preferably about 
1 to 10 residues, and typically are contiguous. 

35 The results discosed in the examples show that the N-terminus of PSTPIP is required for the 

formation of a correctly folded protein that is capable of binding PTPHSCF. Accordingly, if structural 
integrity and biological activity are to be retained, any N-terminal deletion should not extend beyond about 
amino acid 25 of the murine PSTPIP sequence or the corresponding amino acid in the human or other 
mammalian sequences. The presence of the C-terminai portion of the PSTPIP proteins is less critical. The 
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coiled-coil domain is sufficient for proper folding of the protein, as attested by data showing that transfection 
of the coiled-coil domain of PSTPIP results in co-localization of the protein with the cortical actin cytoskeleton 
and the lamcllipodia, an event which presumably requires a correctly folded protein. As noted before, and is 
shown in Figure 1A, in the murine PSTPIP sequence (SEQ ID NO:l) the coiled-coil domain is defined as 
5 extending from about amino acid position 30 to about amino acid position 261. Similar domains can be 
readily identified in PSTPIPs from other mammalian species, e.g. humans. 

Amino acid insertions include amino- and/or carboxyi-terminal fusions ranging in length from one 
residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions of single 
or multiple amino acid residues. Intrasequence insertions (i.e. insertions within the PSTPIP protein amino acid 

10 sequence) may range generally from about 1 to 10 residues, more preferably 1 to 5 residues, more preferably 
1 to 3 residues. Examples of terminal insertions include the PSTPIP polypeptides with an N-terminal 
methionyl residue, an artifact of its direct expression in bacterial recombinant cell culture, and fusion of a 
heterologous N-terminal signal sequence to the N-terminus of the PSTPIP molecule to facilitate the secretion 
of the mature PSTPIP from recombinant host cells. Such signal sequences will generally be obtained from. 

15 and thus homologous to, the intended host cell species. Suitable sequences include STII or Ipp for E. coli . 
alpha factor for yeast, and viral signals such as herpes gD for mammalian cells. 

Other insertional variants of the native PSTPIP molecules include the fusion of the N- or C- 
terminus of the PSTPIP molecule to immunogenic polypeptides, e.g. bacterial polypeptides such as beta- 
lactamase or an enzyme encoded by the E. coli trp locus, or yeast protein, and C-terminal fusions with proteins 

20 having a long half-life such as immunoglobulin regions (preferably immunoglobulin constant regions), 
albumin, or ferritin, as described in WO 89/02922 published on 6 April 1989. 

Further insertional variants are immunologically active derivatives of the novel PSTPIP 
polypeptides, which comprise the PSTPIP polypeptide and a polypeptide containing an epitope of an 
immunologically competent extraneous polypeptide, i.e. a polypeptide which is capable of eliciting an immune 

25 response in the animal to which the fusion is to be administered or which is capable of being bound by an 
antibody raised against an extraneous polypeptide. Typical examples of such immunologically competent 
polypeptides are allergens, autoimmune epitopes, or other potent immunogens or antigens recognized by pre- 
existing antibodies in the fusion recipient, including bacterial polypeptides such as trpLE, p-galactosidase, 
viral polypeptides such as herpes gD protein, and the like. 

30 Immunogenic fusions are produced by cross-linking in vitro or by recombinant cell culture 

transformed with DNA encoding an immunogenic polypeptide. It is preferable that the immunogenic fusion 
be one in which the immunogenic sequence is joined to or inserted into a novel PSTPIP molecule or fragment 
thereof by (a) peptide bond(s). These products therefore consist of a linear polypeptide chain containing the 
PSTPIP epitope and at least one epitope foreign to the PSTPIP polypeptide. It will be understood that it is 

35 within the scope of this invention to introduce the epitopes anywhere within a PSTPIP molecule of the present 
invention or a fragment thereof. These immunogenic insertions are particularly useful when formulated into 
a pharmacologically acceptable carrier and administered to a subject in order to raise antibodies against the 
PSTPIP molecule, which antibodies in turn are useful as diagnostics, in tissue-typing, or in purification of the 
novel PSTPIP polypeptides by immunoaffinity techniques known perse. Alternatively, in the purification of 
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the PSTPIP polypeptides of the present invention, binding partners for the fused extraneous polypeptide, e.g. 
antibodies, receptors or ligands, are used to adsorb the fusion from impure admixtures, after which the fusion 
is eluted and, if desired, the novel PSTPIP is recovered from the fusion, e.g. by enzymatic cleavage. 

After identifying the desired mutation(s), the gene encoding a PSTPIP variant can, for example, 
5 be obtained by chemical synthesis using well known techniques. More preferably, DNA encoding a PSTPIP 
amino acid sequence variant is prepared by site-directed mutagenesis of DNA that encodes an earlier prepared 
variant or a non variant version of the PSTPIP. Site-directed (site-specific) mutagenesis allows the production 
of PSTPIP variants through the use of specific oligonucleotide sequences that encode the DNA sequence of 
the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of 

10 sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being 
traversed. Typically, a primer of about 20 to 25 nucleotides in length is preferred, with about 5 to 10 residues 
on both sides of the junction of the sequence being altered. In general, the techniques of site-specific 
mutagenesis are well known in the art, as exemplified by publications such as Edelman et al., DNA 2:183 
(1983). As will be appreciated, the site-specific mutagenesis technique typically employs a phage vector that 

15 exists in both a single-stranded and double-stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the M13 phage, for example, as disclosed by Messing et al.. Third Cleveland 
Symposium on Macromolecules and Recombinant DNA. A. Walton, ed., Elsevier, Amsterdam (1981). This 
and other phage vectors are commercially available and their use is well known to those skilled in the art, 
A versatile and efficient procedure for the construction of oligodeoxyribonucleotide directed site-specific 

20 mutations in DNA fragments using M13-derived vectors was published by Zoller et al., Nucleic Acids Res. 
10:6487-6500 [1982]). Also, plasmid vectors that contain a single-stranded phage origin of replication (Veira 
et al., Meth. Enzymol. 153:3 [1987]) may be employed to obtain single-stranded DNA. Alternatively, 
nucleotide substitutions are introduced by synthesizing the appropriate DNA fragment in vitro, and amplifying 
it by PCR procedures known in the art. 

25 The PCR technique may also be used in creating amino acid sequence variants of a PSTPIP 

polypeptide. In a specific example of PCR mutagenesis, template plasmid DNA (1 ug) is linearized by 
digestion with a restriction endonuclease that has a unique recognition site in the plasmid DNA outside of the 
region to be amplified. Of this material, 100 ng is added to a PCR mixture containing PCR buffer, which 
contains the four deoxynucleotide triphosphates and is included in the GENEAMP R kits (obtained from 

30 Perkin-Elmer Cetus, Norwalk, CT and Emeryville, CA), and 25 pmoie of each oligonucleotide primer, to a 
final volume of 50 ul The reaction mixture is overlayered with 35 nl mineral oil. The reaction is denatured 
for 5 minutes at 100°C, placed briefly on ice, and then 1 u.1 Thermus aquaticus (Tag,) DNA polymerase (5 
units/u.1), purchased from Perkin-Elmer Cetus, Norwalk, CT and Emeryville, CA) is added below the mineral 
oil layer. The reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin-Elmer 

35 Cetus) programmed as follows: 
2 min. 55oC, 

30 sec. 72°C, then 19 cycles of the following: 
30 sec. 94oC, 
30 sec. 55°C, and 
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30 sec. 72oC. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous 
phase transferred to a new vial, extracted with phenol/chloroform (50:50 vol), and ethanol precipitated, and 
the DNA is recovered by standard procedures. This material is subsequently subjected to appropriate 
5 treatments for insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described 
by Wells et al., Gene 34:315 (1985). 

Additionally, the so-called phagemid display method may be useful in making amino acid sequence 
variants of native or variant PSTPIP polypeptides or their fragments. This method involves (a) constructing 
10 a replicable expression vector comprising a first gene encoding a receptor to be mutated, a second gene 
encoding at least a portion of a natural or wild-type phage coat protein wherein the first and second genes are 
heterologous, and a transcription regulatory element operably linked to the first and second genes, thereby 
forming a gene fusion encoding a fusion protein; (b) mutating the vector at one or more selected positions 
within the first gene thereby forming a family of related plasmids; (c) transforming suitable host cells with the 
15 plasmids; (d) infecting the transformed host cells with a helper phage having a gene encoding the phage coat 
protein; (e) culturing the transformed infected host cells under conditions suitable for forming recombinant 
phagemid particles containing at least a portion of the plasmid and capable of transforming the host, the 
conditions adjusted so that no more than a minor amount of phagemid particles display more than one copy 
of the fusion protein on the surface of the particle; (0 contacting the phagemid particles with a suitable antigen 
20 so that at least a portion of the phagemid particles bind to the antigen; and (g) separating the phagemid 
panicles that bind from those that do not. Steps (d) through (g) can be repeated one or more times. Preferably 
in this method the plasmid is under tight control of the transcription regulatory element, and the culturing 
conditions are adjusted so that the amount or number of phagemid particles displaying more than one copy 
of the fusion protein on the surface of the particle is less than about 1 %. Also, preferably, the amount of 
25 phagemid particles displaying more than one copy of the fusion protein is less than 10% of the amount of 
phagemid particles displaying a single copy of the fusion protein. Most preferably, the amount is less than 
20%. Typically in this method, the expression vector will further contain a secretory signal sequence fused 
to the DNA encoding each subunit of the polypeptide and the transcription regulatory element will be a 
promoter system. Preferred promoter systems are selected from ]ac Z, X PL , tac, T7 polymerase, tryptophan, 
30 and alkaline phosphatase promoters and combinations thereof. Also, normally the method will employ a 
helper phage selected from M13K07, M 13R408, M13-VCS, and Phi X 174. The preferred helper phage is 
M13K07, and the preferred coat protein is the Ml 3 Phage gene III coat protein. The preferred host is E. co/i % 
and protease-deficient strains of E. coli. 

Since it is often difficult to predict in advance the characteristics of a variant PSTPIP, it will be 
35 appreciated that some screening will be needed to select the optimum variant. 

Further details of the foregoing and similar mutagenesis techniques are found in general textbooks, 
such as, for example, Sambrook et al., supra, and Current Protocols in Molecular Biology, Ausubel et al. eds., 
supra. 
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F. Glvcosvlat ion Variants 

Glycosylation variants are included within the scope of the present invention. They include variants 
completely lacking in glycosylation (unglycosylated), variants having at least one less glycosylated site than 
the native form (deglycosylated) as well as variants in which the glycosylation has been changed. Included 
5 are deglycosylated and unglycosylated amino acid sequences variants, deglycosylated and unglycosylated 
native PSTPIP, and other glycosylation variants. For example, substitutional or deletional mutagenesis may 
be employed to eliminate the N- or O-linked glycosylation sites in the a native or variant PSTPIP molecule 
of the present invention, e.g. the asparagine residue may be deleted or substituted for another basic residue 
such as lysine or histidine. Alternatively, flanking residues making up the glycosylation site may be 

10 substituted or deleted, even though the asparagine residues remain unchanged, in order to prevent 
glycosylation by eliminating the glycosylation recognition site. 

Additionally, unglycosylated PSTPIP polypeptides which have the glycosylation sites of a native 
molecule may be produced in recombinant prokaryotic cell culture because prokaryotes are incapable of 
introducing glycosylation into polypeptides. 

1 5 Glycosylation variants may be produced by selecting appropriate host cells or by in vitro methods. 

Yeast and insect cells, for example, introduce glycosylation which varies significantly from that of mammalian 
systems. Similarly, mammalian cells having a different species (e.g. hamster, murine, porcine, bovine or 
ovine), or tissue origin (e.g. lung, liver, lymphoid, mesenchymal or epidermal) than the source of the PSTPIP 
polypeptide are routinely screened for the ability to introduce variant glycosylation as characterized for 

20 example by elevated levels of mannose or variant ratios of mannose, fucose, sialic acid, and other sugars 
typically found in mammalian glycoproteins. In vitro processing of the PSTPIP typically is accomplished by 
enzymatic hydrolysis, e.g. neuraminidase digestion. 

G. Covalent Modifications of PSTPIP Polypeptides 

Covalent modifications of PSTPIP polypeptides are included within the scope herein. Such 
25 modifications are traditionally introduced by reacting targeted amino acid residues of the PSTPIP polypeptides 
with an organic derivatizing agent that is capable of reacting with selected sites or terminal residues, or by 
harnessing mechanisms of post-translational modifications that function in selected recombinant host cells. 
The resultant covalent derivatives are useful in programs directed at identifying residues important for 
biological activity, for immunoassays of the PSTPIP, or for the preparation of anti-PSTPIP antibodies for 
30 immunoaffinity purification of the recombinant. For example, complete inactivation of the biological activity 
of the protein after reaction with ninhydrin would suggest that at least one arginyl or lysyl residue is critical 
for its activity, whereafter the individual residues which were modified under the conditions selected are 
identified by isolation of a peptide fragment containing the modified amino acid residue. Such modifications 
are within the ordinary skill in the art and are performed without undue experimentation. 
35 Cysteinyl residues most commonly are reacted with ot-haloacetates (and corresponding amines), 

such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. 
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Cysteinyi residues also are derivatized by reaction with bromotrifluoroacetone, oc-bromo-p-(5- 
imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2- 
pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3- 
diazole. 

5 Histidyi residues are derivatized by reaction with diethyipyrocarbonate at pH 5.5-7.0 because this 

agent is relatively specific for the histidyi side chain. Para-bromophenacyl bromide also is useful; the reaction 
is preferably performed in 0. 1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. 
Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable 
10 reagents for derivatizing oc-amino-containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2,4- 
pentanedione; and transarninase-catalyzed reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, among them 
phenylglyoxal, 2,3-butanedione. 1.2-cyclohexanedione, and ninhydrin. Derivatization of arginine residues 
15 requires that the reaction be performed in alkaline conditions because of the high pK a of the guanidine 
functional group. Furthermore, these reagents may react with the groups of lysine as well as the arginine 
epsilon-amino group. 

The specific modification of tyrosyl residues may be made, with particular interest in introducing 
spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromethane. 

20 Most commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyi tyrosyl species and 3- 
nitro derivatives, respectively. Tyrosyl residues are iodinated using l25 l or 13 1 1 to prepare labeled proteins 
for use in radioimmunoassay. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides 
(R'-N^ON-R*) such as l-cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or 1 -ethyl-3-(4-azonia-4,4- 

25 dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and 
aspartyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Either form 
of these residues falls within the scope of this invention. 

30 Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl 

groups of seryl, threonyl or tyrosyl residues, methylation of the a -amino groups of lysine, arginine, and 
histidine side chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., 
San Francisco, pp. 79-86 [1983]), acetylation of the N-terminal amine, and amidation of any C-terminal 
carboxyl group. The molecules may further be covalently linked to nonproteinaceous polymers, e.g. 

35 polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. patents 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Derivatization with bifunctional agents is useful for preparing intramolecular aggregates of the 
PSTP1P polypeptides with polypeptides as well as for cross-linking the PSTPIP polypeptide to a water 
insoluble support matrix or surface for use in assays or affinity purification. In addition, a study of interchain 
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cross-links will provide direct information on conformational structure. Commonly used cross-Unking agents 
include l,l-bis(diazoacetyl)-2-phenyIethane, glutaraldehyde, N-hydroxysuccinimide esters, homobifunctional 
imidoesters, and bifunctional maleimides. Derivatizing agents such as rnethyl-3-[(p- 
azidophenyl)dithio]propioimidate yield photoactivatable intermediates which are capable of forming cross- 
5 links in the presence of light. Alternatively, reactive water insoluble matrices such as cyanogen bromide 
activated carbohydrates and the systems reactive substrates described in U.S. Patent Nos. 3,959,642; 
3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537; 4,055,635; and 4,330,440 are employed for protein 
immobilization and cross-linking. 

Certain post-trans lational modifications are the result of the action of recombinant host cells on the 

10 expressed polypeptide. Glutaminyl and aspariginyl residues are frequently post-translationally deamidated 
to the corresponding glutamyl and aspartyl residues. Alternatively, these residues are deamidated under mildly 
acidic conditions. Either form of these residues falls within the scope of this invention. 

Other derivatives comprise the novel peptides of this invention covalently bonded to a 
nonproteinaceous polymer. The nonproteinaceous polymer ordinarily is a hydrophilic synthetic polymer, i.e. 

15 a polymer not otherwise found in nature. However, polymers which exist in nature and are produced by 
recombinant or in vitro methods are useful, as are polymers which are isolated from nature. Hydrophilic 
polyvinyl polymers fall within the scope of this invention, e.g. poiyvinyialcohol and polyvinylpyrrolidone. 
Particularly useful are polyvinylalkylene ethers such a polyethylene glycol, polypropylene glycol. 

The PSTPIP polypeptides may be linked to various nonproteinaceous polymers, such as 

20 polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The PSTPIP polypeptides may be entrapped in microcapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization, in colloidal drug delivery systems (e.g. liposomes, 
albumin microspheres, microemulsions. nano-particles and nanocapsules), or in macroernulsions. Such 

25 techniques are disclosed in Remington's Pharmaceutical Sciences . 16th Edition, Oslo, A., Ed. 

Further derivatives of the PSTPIP polypeptides herein are the so-called "immunoadhesins'\ To 
date, more than fifty immunoadhesins have been reported in the art. Immunoadhesins reported in the literature 
include, for example, fusions of the T cell receptor (Gascoigne et ah, Proc. NatL Acad. ScL USA 84:2936-2940 
[1987]); CD4 (Capon et al., Nature 337:525-531 [1989]; Traunecker et al., Nature 339:68-70 [1989]; 

30 ZettmeissI et al., DNA Cell Biol. USA 9:347-353 [1990]; Byrn et al., Nature 344:667-670 [1990]); L-selectin 
(homing receptor) (Watson et al., J. Cell. Biol 1 10:2221-2229 [1990]; Watson et al., Nature 349:164-167 
[1991]); E-selectin [Mulligan et al., J. Immunol. 151:6410-17 [1993]; Jacob et al., Biochemistry 34:1210-1217 
[1995]); P-selectin (Mulligan et al., supra : Hollenbaugh et aL, Biochemistry 34:5678-84 [1995]); ICAM-1 
(Stauton etaLJ. Exp. Med. 176: 1471-1476 [1992]; Martin et aL, J. Virol. 67:3561-68 [1993]; Roep et al., 

35 Lancet 343: 1590-93 [1994]); ICAM-2 (Damle et al., J. Immunol. 148:665-71 [1992]); ICAM-3 (Holness et 
al., J. Biol. Chenu 270:877-84 [1995]); LFA-3 (Kanner et aL, J. Immunol. 148:223-229 [1992]); LI 
glycoprotein (Doherty et aL, Neuron 14:57-66 [1995]); TNF-R1 (Ashkenazi et aL, Proc. Natl. Acad. ScL USA 
88:10535-539 [1991]; Lesslauer et aL, Eur. J. Immunol. 21:2883-86 [1991]; Peppel et aL, J. Exp. Med. 
174:1483-1489 [1991]); TNF-R2 (Zack et aL, Proc. NatL Acad. ScL USA 90:2335-39 [1993]; Wooiey et aL, 
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J. Immunol 151:6602-07 [1993]); CD44 [Aruffo et a!., £ell 61 : 1303-1 3 13 (1990)]; CD28 and B7 [Linsley 
et al., J. Exp. Med. 173:721-730 (1991)]; CTLA-4 [Lisley et al., J. Exp. Med. 174:561-569 (1991)]; CD22 
[Stamenkovic et al„ Cell 66:1 133-1 144 (1991)]; NP receptors [Bennett et al., J. Biol. Chem. 266:23060-23067 
(1991)]; IgE receptor a [Ridgway and Gorman. J. Cell. Biol. 115,abstr. 1448(1991)]; HGF receptor [Mark, 
5 M.R. et al., 1992, J. Biol. Chem. submitted]; IFN-yR a- and p-chain [Marsters et al., Proc. Natl. Acad. Sci. 
USA 92:5401-05 [1995]); trk-A, -B, and -C (Shelton et al., J. Neurosci 15:477-91 [1995]); IL-2 (Landolfi, 
J. Immunol. 146:915-19 [1991]); IL-10 (Zheng et al., J. Immunol. 154:5590-5600 [1995]). 

The simplest and most straightforward immunoadhesin design combines the binding region(s) of 
the 'adhesin* protein with the hinge and Fc regions of an immunoglobulin heavy chain. Ordinarily, when 
0 preparing the PSTPIP-immunoglobulin chimeras of the present invention, nucleic acid encoding the desired 
PSTPIP polypeptide will be fused C-terminally to nucleic acid encoding the N-terminus of an immunoglobulin 
constant domain sequence, however N-terminal fusions are also possible. Typically, in such fusions the 
encoded chimeric polypeptide will retain at least functionally active hinge, CH2 and CH3 domains of the 
constant region of an immunoglobulin heavy chain. Fusions are also made to the C-terminus of the Fc portion 
5 of a constant domain, or immediately N-terminal to the CHI of the heavy chain or the corresponding region 
of the light chain. The precise site at which the fusion is made is not critical; particular sites are well known 
and may be selected in order to optimize the biological activity, secretion or binding characteristics of the 
PSTPIP-immunoglobulin chimeras. 

In a preferred embodiment, the sequence of a native, mature PSTPIP polypeptide, or variant or 
20 fragment thereof, is fused to the N-terminus of the C-terminal portion of an antibody (in particular the Fc 
domain), containing the effector functions of an immunoglobulin, e.g. lgG-1. It is possible to fuse the entire 
heavy chain constant region to the PSTPIP sequence. However, more preferably, a sequence beginning in the 
hinge region just upstream of the papain cleavage site (which defines IgG Fc chemically; residue 216, taking 
the first residue of heavy chain constant region to be 1 14 [Kabat et al., supra ! . or analogous sites of other 
25 immunoglobulins) is used in the fusion. In a particularly preferred embodiment, the PSTPIP sequence (full 
length or fragment or variant) is fused to the hinge region and CH2 and CH3 or CH 1 , hinge, CH2 and CH3 
domains of an IgG-1, IgG-2, or IgG-3 heavy chain. The precise site at which the fusion is made is not critical, 
and the optimal site can be determined by routine experimentation. 

In some embodiments, the PSTPIP-immunoglobulin chimeras are assembled as multimers, and 
30 particularly as homo-dimers or -tetramers (WO 9 1 /08298). Generally, these assembled immunoglobulins will 
have known unit structures. A basic four chain structural unit is the form in which IgG, IgD, and IgE exist. 
A four unit is repeated in the higher molecular weight immunoglobulins; IgM generally exists as a pentamer 
of basic four units held together by disulfide bonds. IgA globulin, and occasionally IgG globulin, may also 
exist in multimeric form in serum. In the case of multimer, each four unit may be the same or different. 
35 Various exemplary assembled PSTPIP-immunoglobulin chimeras within the scope herein are 

schematically diagramed below: 

(a) AC L -AC L ; 

(b) AC H -[AC„, AC L -AC H , AC L -V H C H , or V L C L -AC H ]; 

(c) AC L -AC H -[AC L -AC H , AC L -V„C„, V L C L -AC H , or V L C L -V H C H ]; 
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(d) AC L -V H C H -[AC H> or AC L -V H C H , or V L C L -AC H ]; 

(e) V L C L -AC H -[AC L -V H C H , or V L C L -AC H ]; and 

(f) [A-Y] n -[V L C L -V H C H ] 2> 
wherein 

5 each A represents identical or different novel PSTPIP polypeptide amino acid sequences; 

is an immunoglobulin light chain variable domain; 
V H is an immunoglobulin heavy chain variable domain; 
C L is an immunoglobulin light chain constant domain; 
C|_| is an immunoglobulin heavy chain constant domain; 
10 n is an integer greater than 1 ; 

Y designates the residue of a covalent cross-linking agent. 

In the interests of brevity, the foregoing structures only show key features; they do not indicate 
joining (J) or other domains of the immunoglobulins, nor are disulfide bonds shown. However, where such 
domains are required for binding activity, they shall be constructed as being present in the ordinary locations 

15 which they occupy in the immunoglobulin molecules. 

Alternatively, the PSTPIP amino acid sequences can be inserted between immunoglobulin heavy 
chain and light chain sequences such that an immunoglobulin comprising a chimeric heavy chain is obtained. 
In this embodiment, the PSTPIP polypeptide sequences are fused to the 3' end of an immunoglobulin heavy 
chain in each arm of an immunoglobulin, either between the hinge and the CH2 domain, or between the CH2 

20 and CH3 domains. Similar constructs have been reported by Hoogenboom. H. R. et al„ MoL Immunol. 
28:1027-1037 (1991). 

Although the presence of an immunoglobulin light chain is not required in the immunoadhesins of 
the present invention, an immunoglobulin light chain might be present either covalently associated to a 
PSTPlP-immunoglobuiin heavy chain fusion polypeptide, or directly fused to the PSTPIP polypeptide. In 

25 the former case, DNA encoding an immunoglobulin light chain is typically coexpressed with the DNA 
encoding the PSTPIP-immunoglobulin heavy chain fusion protein. Upon secretion, the hybrid heavy chain 
and the light chain will be covalently associated to provide an immunoglobulin-Iike structure comprising two 
disulftde-1 inked immunoglobulin heavy chain-light chain pairs. Method suitable for the preparation of such 
structures are, for example, disclosed in U.S. Patent No. 4,81 6,567 issued 28 March 1989. 

30 In a preferred embodiment, the immunoglobulin sequences used in the construction of the 

immunoadhesins of the present invention are from an IgG immunoglobulin heavy chain constant domain. For 
human immunoadhesins, the use of human IgG-1 and IgG-3 immunoglobulin sequences is preferred. A major 
advantage of using IgG- 1 is that IgG-1 immunoadhesins can be purified efficiently on immobilized protein 
A. In contrast, purification of IgG-3 requires protein G, a significantly less versatile medium. However, other 

35 structural and functional properties of immunoglobulins should be considered when choosing the Ig fusion 
partner for a particular immunoadhesin construction. For example, the IgG-3 hinge is longer and more 
flexible, so it can accommodate larger 'adhesin' domains that may not fold or function properly when fused 
to IgG-1 . While IgG immunoadhesins are typically mono- or bivalent, other Ig subtypes like IgA and IgM may 
give rise to dimeric or pentameric structures, respectively, of the basic Ig homodimer unit. Multimeric 
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immunoadhesins are advantageous in that they can bind their respective targets with greater avidity than their 
IgG-based counterparts. Reported examples of such structures are CD4-IgM (Traunecker et al., supra ): ICAM- 
IgM (Martin et al., J, Virol 67:3561-68 [1993]); andCD2-IgM (Arulanandam etal., J. Exp. Med. 177:1439-50 
[1993]). 

5 For PSTPIP-lg immunoadhesins, which are designed for in vivo application, the pharmacokinetic 

properties and the effector functions specified by the Fc region are important as well. Although IgG- 1 , IgG-2 
and IgG-4 all have in vivo half-lives of 21 days, their relative potencies at activating the complement system 
are different. IgG-4 does not activate complement, and IgG-2 is significantly weaker at complement activation 
than IgG- 1 . Moreover, unlike IgG-1, IgG-2 does not bind to Fc receptors on mononuclear cells or neutrophils. 
1 0 While IgG-3 is optimal for complement activation, its in vivo half-life is approximately one third of the other 
IgG isotypes. Another important consideration for immunoadhesins designed to be used as human 
therapeutics is the number of allotypic variants of the particular isotype. In general, IgG isotypes with fewer 
serologically-defined allotypes are preferred. For example, IgG- 1 has only four seroiogically-defined allotypic 
sites, two of which (Glm and 2) are located in the Fc region; and one of these sites Glmh is non- 
15 immunogenic. In contrast, there are 12 serologically-defined allotypes in IgG-3, all of which are in the Fc 
region; only three of these sites (G3m5, 1 1 and 21) have one allotype which is nonimmunogenic. Thus, the 
potential immunogenicity of a y3 immunoadhesin is greater than that of a y 1 immunoadhesin. 

PSTPIP-lg immunoadhesins are most conveniently constructed by fusing the cDNA sequence 
encoding the PSTPIP portion in-frame to an Ig cDNA sequence. However, fusion to genomic Ig fragments 
20 can also be used (see, e.g. Gascoigne et al.. Proc. Nad. Acad. Sci. USA 84:2936-2940 [1987]; Aruffo et aL, 
Ce//6l:1303-I313 [1990]; Stamenkovic et aL, Cell 66: 1 133-1 144 [1991]). The latter type of fusion requires 
the presence of Ig regulatory sequences for expression. cDNAs encoding IgG heavy-chain constant regions 
can be isolated based on published sequence from cDNA libraries derived from spleen or peripheral blood 
lymphocytes, by hybridization or by polymerase chain reaction (PCR) techniques. 
25 H. Anti-PSTPIP Antibody Preparation 

(i) Polyclonal antibodies 

Polyclonal antibodies to a PSTPIP molecule generally are raised in animals by multiple 
subcutaneous (sc) or intraperitoneal (ip) injections of the PSTPIP and an adjuvant. It may be useful to 
conjugate the PSTPIP or a fragment containing the target amino acid sequence to a protein that is 
30 immunogenic in the species to be immunized, e.g. keyhole limpet hemocyanin, serum albumin, bovine 
thyroglobulin, or soybean trypsin inhibitor using a bifunctional or derivatizing agent, for example 
maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide 
(through lysine residues), glutaraldehyde, succinic anhydride, SOCl 2 , or R 1 N=C=NR, where R and R 1 are 
different alkyl groups. 

35 Animals are immunized against the immunogenic conjugates or derivatives by combining 1 mg or 

1 fig of conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's complete adjuvant and 
injecting the solution intradermally at multiple sites. One month later the animals are boosted with 1/5 to 1/10 
the original amount of conjugate in Freund's complete adjuvant by subcutaneous injection at multiple sites. 
7 to 14 days later the animals are bled and the serum is assayed for anti-PSTPIP antibody titer. Animals are 
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boosted until the titer plateaus. Preferably, the animal boosted with the conjugate of the same PSTPIP 
polypeptide, but conjugated to a different protein and/or through a different cross-linking reagent. Conjugates 
also can be made in recombinant cell culture as protein fusions. Also, aggregating agents such as alum are 
used to enhance the immune response. 

(ii) Monoclonal antibodies 

Monoclonal antibodies are obtained from a population of substantially homogeneous antibodies, 
i.e., the individual antibodies comprising the population are identical except for possible naturally-occurring 
mutations that may be present in minor amounts. Thus, the modifier "monoclonal" indicates the character of 
the antibody as not being a mixture of discrete antibodies. 

For example, the anti-PSTPIP monoclonal antibodies of the present invention may be made using 
the hybridoma method first described by Kohier & Milstein, Nature 256:495 (1975), or may be made by 
recombinant DNA methods [Cabilly et aL, U.S. Pat. No, 4,816,567]. 

In the hybridoma method, a mouse or other appropriate host animal, such as hamster is immunized 
as hereinabove described to elicit lymphocytes that produce or are capable of producing antibodies that will 
specifically bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro. 
Lymphocytes then are fused with myeloma cells using a suitable fusing agent, such as polyethylene glycol, 
to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, pp.59- 103 (Academic 
Press, 1986)]. 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, parental 
myeloma cells. For example, if the parental myeloma cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine (HAT medium), which substances prevent the growth of HGPRT- 
deficient cells. 

Preferred myeloma cells are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. 
Among these, preferred myeloma cell lines are murine myeloma lines, such as those derived from MOPC-21 
and MPC-1 1 mouse tumors available from the Salk Institute Cell Distribution Center, San Diego, California 
USA, and SP-2 cells available from the American Type Culture Collection, Rockville, Maryland USA. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of 
human monoclonal antibodies [Kozbor,7. Immunol 133:3001 (1984); Brodeur et aL. Monoclonal Antibody 
Production Techniques and Applications, pp.5 1-63 (Marcel Dekker, Inc.. New York, 1987)]. 

Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against a PSTPIP polypeptide. Preferably, the binding specificity of monoclonal antibodies 
produced by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (R1A) or enzyme-linked immunoabsorbent assay (ELISA). The binding affinity of the 
monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson & Pollard, Anal. 
Biochem. 107:220(1980). 
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After hybridoma cells are identified that produce antibodies of the desired specificity, affinity, 
and/or activity, the clones may be subcloned by limiting dilution procedures and grown by standard methods. 
Goding, Monoclonal Antibodies: Principles and Practice, pp.59- 1 04 (Academic Press, 1 986). Suitable culture 
media for this purpose include, for example, Dulbecco's Modified Eagle's Medium or RPM1-1640 medium. 
5 In addition, the hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture 
medium, ascites fluid, or serum by conventional immunoglobulin purification procedures such as. for example, 
protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity 
chromatography. 

1 o DNA encoding the monoclonal antibodies of the invention is readily isolated and sequenced using 

conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes 
encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then 
transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that 

15 do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence 
for human heavy and light chain constant domains in place of the homologous murine sequences, Morrison 
et al., Proc. Nat. Acad. Set. 81:6851 (1984), or by covalently joining to the immunoglobulin coding sequence 
all or part of the coding sequence for a non-immunoglobulin polypeptide. In that manner, "chimeric" or 

20 "hybrid" antibodies are prepared that have the binding specificity of an anti-PSTPIP monoclonal antibody 
herein. 

Typically such non-immunoglobulin polypeptides are substituted for the constant domains of an 
antibody of the invention, or they are substituted for the variable domains of one antigen-combining site of 
an antibody of the invention to create a chimeric bivalent antibody comprising one antigen-combining site 
25 having specificity for a PSTPIP polypeptide and another antigen-combining site having specificity for a 
different antigen. 

Chimeric or hybrid antibodies also may be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins may be 
constructed using a disulfide exchange reaction or by forming' a thioether bond. Examples of suitable reagents 
30 for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate. 

For diagnostic applications, the antibodies of the invention typically will be labeled with a 
detectable moiety. The detectable moiety can be any one which is capable of producing, either directly or 
indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such as 3 H, ,4 C, 
32p 35<^ or I25j a fl uor escent or cherniluminescent compound, such as fluorescein isothiocyanate. 
35 rhodamine, or luciferin; biotin or an enzyme, such as alkaline phosphatase, p-galactosidase or horseradish 
peroxidase. 

Any method known in the art for separately conjugating the antibody to the detectable moiety may 
be employed, including those methods described by Hunter et al., Nature 144:945 (1962); David et al., 
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Biochemistry 13:1014 (1974); Pain ct al., J. Immunol. Meth. 40:219 (1981) and Nygren, J. Histochem. and 
Cytochem. 30:407 (1982). 

The antibodies of the present invention may be employed in any known assay method, such as 
competitive binding assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, 
5 Monoclonal Antibodies: A Manual of Techniques, pp. 147- 1 58 (CRC Press, Inc., 1 987). 

Competitive binding assays rely on the ability of a labeled standard (which may be a PSTPIP 
polypeptide or an immunologically reactive portion thereof) to compete with the test sample analyte (PSTPIP) 
for binding with a limited amount of antibody. The amount of PSTPIP polypeptide in the test sample is 
inversely proportional to the amount of standard that becomes bound to the antibodies. To facilitate 

10 determining the amount of standard that becomes bound, the antibodies generally are insolubilized before or 
after the competition, so that the standard and analyte that are bound to the antibodies may conveniently be 
separated from the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different 
immunogenic portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte 

15 is bound by a first antibody which is immobilized on a solid support, and thereafter a second antibody binds 
to the analyte. thus forming an insoluble three part complex. David & Greene, U.S. Pat No. 4,376, 1 10. The 
second antibody may itself be labeled with a detectable moiety (direct sandwich assays) or may be measured 
using an anti-immunoglobulin antibody that is labeled with a detectable moiety (indirect sandwich assay). For 
example, one type of sandwich assay is an ELISA assay, in which case the detectable moiety is an enzyme. 

20 (iii) Humanized antibodies 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 
"import" variable domain. Humanization can be essentially performed following the method of Winter and 

25 co-workers [Jones et aL Nature 321:522-525 (1986); Riechmann et al.. Nature 332:323-327 (1988); 
Verhoeyen et aL. Science 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
antibodies (Cabilly, supra ), wherein substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from anon-human species. In practice, humanized antibodies are 

30 typically human antibodies in which some CDR residues and possibly some FR residues are substituted by 
residues from analogous sites in rodent antibodies. 

It is important that antibodies be humanized with retention of high affinity for the antigen and other 
favorable biological properties. To achieve this goal, according to a preferred method, humanized antibodies 
are prepared by a process of analysis of the parental sequences and various conceptual humanized products 

35 using three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin 
models are commonly available and are familiar to those skilled in the art. Computer programs are available 
which illustrate and display probable three-dimensional conformational structures of selected candidate 
immunoglobulin sequences. Inspection of these displays permits analysis of the iikely role of the residues in 
the functioning of the candidate immunoglobulin sequence, i.e. the analysis of residues that influence the 
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ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues can be selected and 
combined from the consensus and import sequence so that the desired antibody characteristic, such as 
increased affinity for the target antigen(s), is achieved. In general, the CDR residues are directly and most 
substantially involved in influencing antigen binding. For further details see U.S. application Serial No. 
5 07/934,373 filed 21 August 1992, which is a continuation-in-part of application Serial No. 07/715,272 filed 
14 June 1991. 

Alternatively, it is now possible to produce transgenic animals (e.g. mice) that are capable, upon 
immunization, of producing a full repertoire of human antibodies in the absence of endogenous 
immunoglobulin production. For example, it has been described that the homozygous deletion of the antibody 
1 0 heavy chain joining region (J H ) gene in chimeric and germ-line mutant mice results in complete inhibition of 
endogenous antibody production. Transfer of the human germ-line immunoglobulin gene array in such germ- 
line mutant mice will result in the production of human antibodies upon antigen challenge. See, e.g. 
Jakobovits et aL, Proc. Natl Acad Sci USA 90:2551-255 (1993); Jakobovits et al., Nature 362:255-258 
(1993). 

15 (iv) Bispecific antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for a 
PSTPIP polypeptide, the other one is for any other antigen. Methods for making bispecific antibodies are 
known in the art. 

20 Traditionally, the recombinant production of bispecific antibodies is based on the coexpression of 

two immunoglobulin heavy chain-light chain pairs, where the two heavy chains have different specificities 
(Millstein and Cuello, Nature 305:537-539 (1983)). Because of the random assortment of immunoglobulin 
heavy and light chains, these hybridomas (quadromas) produce a potential mixture of 10 different antibody 
molecules, of which only one has the correct bispecific structure. The purification of the correct molecule, 

25 which is usually done by affinity chromatography steps, is rather cumbersome, and the product yields are low. 
Similar procedures are disclosed in PCT application publication No. WO 93/08829 (published 1 3 May 1993), 
and in Traunecker et al., EMBO J. 1 0:3655-3659 ( 1 99 1). 

According to a different and more preferred approach, antibody variable domains with the desired 
binding specificities (antibody-antigen combining sites) are fused to immunoglobulin constant domain 

30 sequences. The fusion preferably is with an immunoglobulin heavy chain constant domain, comprising at least 
part of the hinge, and second and third constant regions of an immunoglobulin heavy chain (CH2 and CH3). 
It is preferred to have the first heavy chain constant region (CHI) containing the site necessary for light chain 
binding, present in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, 
if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are cotransfected 

35 into a suitable host organism. This provides for great flexibility in adjusting the mutual proportions of the 
three polypeptide fragments in embodiments when unequal ratios of the three polypeptide chains used in the 
construction provide the optimum yields. It is, however, possible to insert the coding sequences for two or 
all three polypeptide chains in one expression vector when the expression of at least two polypeptide chains 
in equal ratios results in high yields or when the ratios are of no particular significance. In a preferred 
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embodiment of this approach, the bispecific antibodies are composed of a hybrid immunoglobulin heavy chain 
with a first binding specificity in one arm, and a hybrid immunoglobulin heavy chain-light chain pair 
(providing a second binding specificity) in the other arm. It was found that this asymmetric structure 
facilitates the separation of the desired bispecific compound from unwanted immunoglobulin chain 
5 combinations, as the presence of an immunoglobulin light chain in only one half of the bispecific molecule 
provides for a facile way of separation. This approach is disclosed in PCT application WO 94/04690 
published 3 March 1994. 

For further details of generating bispecific antibodies see, for example, Suresh et al.. Methods in 
Enzymology 1 2 1 :2 1 0 ( 1 986). 

10 (v) Heteroconjugate antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been 
proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of 
HIV infection (PCT application publication Nos. WO 91/00360 and WO 92/200373; EP 03089). 

1 5 Heteroconjugate antibodies may be made using any convenient cross-linking methods. Suitable cross-linking 
agents are well known in the art, and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross- 
linking techniques. 

1. Peptide and Non-Peptide Analogs of PSTPIP Polypeptides 

Peptide analogs of the PSTPIP polypeptides of the present invention are modeled based upon the 

20 three-dimensional structure of the native polypeptides. Peptides may be synthesized by well known 
techniques such as the solid-phase synthetic techniques initially described in M err i fie Id, J. Am. Chem. Soc. 
15:2149-2154 (1963). Other peptide synthesis techniques are, for examples, described in Bodanszky et al., 
Peptide Synthesis* John Wiley & Sons, 2nd Ed., 1976, as well as in other reference books readily available 
for those skilled in the art. A summary of peptide synthesis techniques may be found in Stuart and Young, 

25 Solid Phase Peptide Synthesis, Pierce Chemical Company, Rockford, 1L (1984). Peptides may also be 
prepared by recombinant DNA technology, using a DNA sequence encoding the desired peptide. 

In addition to peptide analogs, the present invention also contemplates non-peptide (e.g. organic) 
compounds which display substantially the same surface as the peptide analogs of the present invention, and 
therefore interact with other molecules in a similar fashion. • 

30 J. Use of the PSTPIP Polypeptides 

The PSTPIP polypeptides of the present invention are useful for a variety of purposes. For 
example, the PSTPIP polypeptide shown in Fig. 1A is useful for identifying and isolating a PSTPIP 
homologue in another mammalian species. Native PSTPIP polypeptides and their functional equivalents are 
also useful in screening assays to identify antagonists or agonists of the native PSTPIP polypeptide. Such 

35 assays may take the form of any conventional cell-type or biochemical binding assay, and can be performed 
in a variety of different assay formats well known to those skilled in the art. 

The PSTPIP polypeptides of the present invention are shown herein to be involved with the 
polymerization of actin monomers in eukaryotic cells. As such, the PSTPIP polypeptides are useful in a 
variety of indications wherein the skilled artisan wishes to induce the polymerization of actin monomers. 
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The PSTPIP polypeptides of the present invention and the nucleic acids encoding them are also 
useful as molecular markers of the tissues in which they are specifically expressed. As such, the PSTPIP 
polypeptides and nucleic acids encoding them are useful for tissue typing of specific mammalian tissues. 

The PSTPIP polypeptides of the present invention are also useful as protein molecular weight 
5 markers on protein gels. 

Nucleic acids encoding the PSTPIP polypeptides of the present invention are also useful for the 
preparation of PSTPIP polypeptides by recombinant techniques exemplified herein and in providing 
hybridization probes for searching cDNA and genomic libraries for the coding sequence of other PSTPIP 
polypeptides analogs in other species. 
1 0 Antagonists of the PSTPIP polypeptides of the present invention are useful for inhibiting at least 

one biological activity of the polypeptides. 

Further details of the invention are illustrated in the following non-limiting examples. 
The PSTPIP polypeptides of the present invention can also be used in in vitro assays together with 
PTP HSCF, to identify inhibitors of the PTP-PSTPIP interaction. Such inhibitors may. for example, be 
1 5 polypeptides, peptides or small (organic) molecules, which inhibit the PTP-PSTPIP interaction by binding to 
PSTPIP and/or to PTP HSCF. Similar assays can be used to find enzymatic inhibitors of dephosphorylation 
of the phosphatase. Such inhibitors may be useful as chemotherapeutic agents, which are abie to stop or 
inhibit the cell division of tumor cells. 

Mutants (amino acid sequence variants) of native PSTPIP polypeptides can be used in vivo in 
20 transfected recombinant host cells to identify other components of the cell divisional machinery. In addition, 
regions of PSTPIP can be used in vivo in the yeast two-hybrid system, or in any functionally similar assay 
configuration, to identify other interacting proteins potentially involved with cell division. 

Antibodies specifically binding PSTPIP can be used, for example, to identify rapidly dividing cells, 
which, in turn, are used to image tumors comprised of such rapidly dividing cells. 
25 Nucleic acid ancoding native PSTPIP molecules can be used to isolate homologous genes 

specifically expressed in tumor cells, which might provide more specific targets for tumor therapy. 
K. Materials and Methods 

1. Two-Hybrid Screening Assay 

The yeast two-hybrid screening assay was performed essentially as described (Chien et al., Proc. 

30 Natl. Acad. Set. USA 88:9578-9582 (1991) and Bartel et al., Methods EnzymoL 254:241-263 (1995)). A 
C 2 2i"S active site mutant of PTP HSCF (Cheng et al., (1996) supra ) was cloned in frame with the Gal4 
binding domain in the plasmid pPC97. A library of 6 x 10 6 individual clones was produced from Baf3 
lymphoid progenitor cells in the Gal4 activation domain plasmid pPC86 using standard procedures. Yeast 
were transformed with both plasmids and were incubated on histidine minus plates for 3 days at 30° C. 

35 Colonies which grew under these conditions were restreaked onto histidine minus plates and were tested for 
P-galactosidase activity (Bartel et al., (1995) supra ). Colonies which manifested various levels of P- 
galactosidase activity were isolated, and the cDNA inserts in the pPC86 vector were isolated by PCR and 
sequenced using standard procedures. Clones encoding PSTPIP were tested for dependence on the PTP 
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interaction by transfection into cells with and without the original PTP HSCF containing pPC97 plasmid and 
subsequent analysis for growth on histidine minus plates and p-galactosidase activity. 
2. Mapping of Interaction Domains 

To obtain a cDNA encoding full-length PSTPIP tagged with the FLAG epitope (DYKDDDDK) 
5 (SEQ ID NO:8) at the C-terminus, PCR was performed using primers 48.BAMHLF (CGCGGATCCACCA 
TGATGGCCCAGCTGCAGTTC) (SEQ ID NO:9) and 48.SALFLAG.R 

(GTACGCGTCGACTCACTTGTCATCGTCGTCC TTGTAGTCGAGCTT) (SEQ ID NO: 10). The resulting 
PCR fragment was digested with BamHI and Sal I and subcloned into the BamHI and Sal I sites of pRK.tkneo, 
an expression plasmid containing the cytomegalovirus promoter, thus creating plasmid pRK.PIP.FLAG.C. 

1 0 The PTP HSCF deletion mutants were derived from a construct containing the influenza hemagglutinin epitope 
at its N-terminus and were made as follows: PCR was performed on PRK.HSCF using primers prkr 
(TGCCTTTCTCTCCACAGG) (SEQ ID NO:l 1) and 38.spe.mid.R (CTCCTTGAGGTTCTACTAGTGGGGG 
CTGGTGTCCTG) (SEQ ID NO:12). The resulting PCR fragment encoding the phosphatase domain (amino 
acids 1-312) was digested with Cla I and Spe 1 and subcloned into pRK.tk.neo digested with Cla I and Xba 

15 I resulting in plasmid pRK.hscf.ptp domain. Similarly, PCR using primers prkr and 39. spe endR 
(G CG G CC G C A CT A G TATCC AG TCTG TGCTCCATCTGTTAC) (SEQ ID NO: 13) was performed and the 
resulting fragment encoding amino acids 1-439 of hscf was digested with Cla I and Spe I and subcloned into 
the Cla I and Xba I sites of pRKtkneo. GST fusion proteins were prepared essentially according to the 
manufacturer (Pharmacia Biotech) in DH5-alpha bacterial cells. A Sal I to Not I fragment containing the 

20 full-length cDNA for PSTPIP (amino acids 2-4 1 5) was subcloned into pGEX-4T-2 (Pharmacia) cleaved at the 
Sal 1 and Not I sites. 

To obtain a DNA fragment encoding the coiled-coil domain of PSTPIP, PCR was performed using 
primers PC86F (GCGTTTGGAATCA CTAC) (SEQ ID NO:14) and pip48.1706R 
(TTATAGTTTAGCGGCCGC TCACCGGTAGTCCTGGGCTGATG) (SEQ ID NO: 15). The PCR fragment 

25 was digested with Sal I and Not I and subsequently cloned into the Sal I and Not I sites of pGEX-4T-2. 

To obtain a cDNA fragment encoding the SH3 domain of PSTPIP, PCR was performed using 
primers pip48.1673.F (GTACG CGTCG ACC GCACTCTACGACTACACTGCACAG) (SEQ ID NO:16) and 
PC86R (CTCTG G CG A AG A AGTCC) (SEQ ID NO: 17) and the resulting product was digested with Sal I and 
Not 1 and subcloned into the Sal 1 and Not I sites of pGEX-4T-2. To obtain a cDNA fragment encoding the 

30 PST (and C-terminal homology) of PTP HSCF (amino acids 304-453), PCR was performed using primers 
PST38-RI (GATCGAATTCCCAGAACCTCAA GGAGAACTGC) (SEQ ID NO:18) and PST38-XHOI 
(GATCCTCGAGTTACACCCGTGTCCACTCTGCTGGAGGA) (SEQ ID NO: 19). The resulting PCR 
product was digested with EcoRl and Xhol and subcloned into the EcoRI and Sal sites of pGEX-4T-2. Protein 
determinations were carried out according to the Couprus assay with a kit from Geno Technology (St Louis). 

35 The binding was carried out according to the method of Wong and Johnson (Wong et al., J. BioL 

Chem. 27 1(35):2098 1-20984 (1996)). Briefly, 1 ug of plasmid with either the PSTPIP protein or PTP HSCF 
under the control of the Sp6 promoter was in vitro transcribed/translated using the Promega TnT Rabbit 
Reticulocyte system. Samples were diluted in 50 mM HEPES, pH 7.2, 1% triton X 100. 10% glycerol, 
lOOmM NaCI, 5mM EDTA and 2 ug/ml each of leupeptin, pepstatin, aprotinin, and PMSF. Samples were 
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pre-cleared with resin for 1 hour and 1 ug GST-fusion protein was added along with 30 of GSH-Sepharose 
that was previously blocked in 3% BSA for 1 hour. This was reacted for 1 hour at 4°C and then the resin 
washed 6 times in HEPES/Triton binding buffer before SDS gel electrophoresis. The peptides were 
synthesized on an automated Milligen 9050 Peptide Synthesizer using standard solid phase chemistry with 
5 FMOC protected amino acids on a p-alkoxybenzyl alcohol resin. Dried peptides were re-suspended in the 
HEPES/Triton Binding buffer at a concentration of 10 mg/ml. Peptide inhibition was performed by adding 
the peptide first to the in vitro translation product and then the GST-fusion followed by the GSH-Sepharose. 
The binding/washing steps were as previously described. The peptides synthesized and the PTPs they were 
derived from were: 

10 PXXP-HSCF: 432 GFNLRIGRPKGPRDPPAEWT 451 (PTP HSCF) (SEQ ID NO:20), 
PXXP-PEP: 782 GFGNRFSKPKGPRNPPSAW 800 (PTP PEP) (SEQ ID NO:21), 
PXXP-PEST: 76I GFGNRCGKPKGPRDPPSEWT 780 (PTP PEST) SEQ ID NO:22), 
PXXP-CONTROL: 3 34GGVLRSISVPAPPTLPMADT 35 3 (PTP HSCF) (SEQ ID NO:23). 
3. Analysis of Tyrosine Phosphorylation 

15 BaO cells were lysed in 1% Triton, 50mM HEPES. 10% glycerol and 5mM EDTA containing 1 

ug/ml aprotinin, PMSF, leupeptin and pepstatin with ImM Sodium Vanadate and iOmM lodoacctic acid. 
Cells were treated with 0. ImM Pervanadate for 4 hours before lysis. Immunoprccipitations were performed 
in the vanadate-containing lysis buffer using 1 Mg/ml anti-PSTPIP polyclonal antibody and 400 ug of Iysate 
protein at 4°C overnight. Western blots were performed using I u.g/ml affinity purified anti-PSTPIP or 1 :5000 

20 dilution of commercial 4G10 anti-phosphotyrosine monoclonal (Upstate Biotech). Signal was detected by 
HRPO - ECL reagents (Pierce). The C 2 2i~S mutant was as previously described (Cheng et aL (1996) supra ). 
The PTP HSCF D 197 -A mutant was generated using PCR. Mutagenesis primer D197A.F 
(GTATATGTCCTGGCCAGCCCATGGGGTTCCCAGCAG) (SEQ ID MO:24), corresponding to nucleotide 
591, and primer D197A.R (GCAGGTCGACTCTAGATTACACCCGTGTCCACTCTG) (SEQ ID NO:25) 

25 which corresponds to the stop codon, were used in PCR to generate a fragment that could be cut with Mscl 
and XbaL pRK.HA.38 WT ? a plasmid which encoded the wild type enzyme under the control of the 
cytomegalovirus promoter (Cheng et aL, (1996) supra l was digested with Clal and Mscl and the resulting 
600bp fragment was ligated with the Mscl-Xbal PCR fragment into the Clal and Xbal sites of pRK.tkneo. A 
plasmid encoding the V-src oncogene under the control of the SV40 early promoter was the kind gift of Dr. 

30 Art Levinson (CEO-Genentech, Inc.). NIH 3T3 ceils and COS-7 ceils were cultured in high glucose DMEM 
supplemented with 10% FBS, 2mM L-Glutamine, IOmM HEPES pH 7.2 and pen-strep. 

COS-7 ceils were transfected by electroporation. Briefly, 1.5 x 10 6 COS-7 cells were mixed with 
24 ug total DNA in PBS and electroporated at 960uF, 0.22 volts (Bio-Rad Gene Pulsar). Following 
electroporation, ceils were seeded in 10cm dishes and incubated for 3 days. 10-cm dishes of transfected COS 

35 cells were washed twice with ice-cold PBS, and lysed in 1ml of M-RIPA (50mM Tris 7.4, 1% NP40, 0.25% 
DOC, 150mM NaCI, ImM sodium ortho-vanadate, ImM NaF plus Complete™ Protease Inhibitors 
(Boehringer Mannheim)). Lysates were incubated for 15 minutes with 100fil UltraLink Immobilized Protein 
A/G (Pierce) at 4°C > followed by centrifugation for 5 minutes. Supernatants were collected and stored at 
-70°C or directly immunoprecipitated. 5 ug of M2 or I2CA5 was added to 500 ul of lysate and incubated 
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overnight at 4°C. Ultralink Protein A/G was added and incubation continued for 2 hours at 4°C The immune 
complexes were washed 3 times with M-RIPA. The proteins were subjected to SDS-PAGE and transferred 
to nitrocellulose in IX Transfer Buffer (Novex). Immunoblots were blocked overnight at 4°C in 3% 
milk/PBS. To detect Flag-tagged PIP, blots were incubated with lOug/ml Bio-M2 (Biotinylated anti-FLAG 
5 monoclonal Ab, KODAK) followed by incubation in lOug/ml streptavidin-HRP (UBI). To detect HA-tagged 
PTPhscf, blots were incubated in anti-(HA)-peroxidase (Boehringer Mannheim) as per manufacturer's 
instructions. To detect phosphotyrosine, blots were incubated in HRP-conjugated 4G 10 (anti-phosphotyrosine 
monoclonal, UBI) as per manufacturer's instructions. 

4. Confocal Microscopy of Endogenous and Translated PSTPIP. 

10 Rabbit polyclonal antibodies were produced against a GST-PSTPIP fusion protein. The complete 

PSTPIP-GST fusion protein was purified on GSH-sepharose and injected intramuscularly at 2 sites with 200 
ug fusion protein and subcutaneously at multiple sites with a total of 300 ug PSTPIP-GST fusion protein in 
Complete Freunds Adjuvant. Rabbits were boosted every 3 weeks with 100 pg fusion protein in Incomplete 
Freunds. 15 ml of rabbit sera was reacted with 0.5mg PSTPIP-GST-GSH-Sepharose for 3 hours at 4°C with 

15 gentle rotation. The resin was collected by centrifugation and washed with 10 column volumes of PBS. 
Immunoglobulin was eluted from the affinity matrix with lOOmM acetic acid, 500mM NaCI, neutralized with 
NaOH, and then dialyzed overnight with PBS. MIH 3T3 celts were seeded at 100,000 cells per chamber slide 
and allowed to adhere overnight. The cells were transfected using Lipofectamine (2ug pRK.PIP.FLAG.C/12 
ui Lipofectamine in 0.8ml OPTI-MEM) for 5 hours. The DNA/Lipofectamine solution was removed and fresh 

20 serum containing medium added. 48 hours following the start of transfection, the cells were fixed in 4% 
formaldehyde in PHEM 6.1 (60mM PIPES, 25mM HEPES, lOmM EGTA and 2mM MgCI 2 ) for 20 minutes, 
then permeabilized in 0.2% Triton X-100, 300mM sucrose in PHEM 6.9 for 10 minutes. The cells were 
washed twice in PHEM 6.9 and then incubated with 10% FBS/PHEM 6.9 for 1 hour to block non-specific 
binding of the antibody. Cells were incubated for 1 hour in 2% BSA/PHEM 6.9 containing lOug/rnl M2 

25 (KODAK, anti-FLAG monoclonal antibody) or lOug/ml 12CA5 (Boehringer Mannheim anti-HA monoclonal 
antibody) as an irrelevant antibody control. After washing cells twice with 2% BSA/PHEM6.9. cells were 
incubated with for 30 minutes with a 1 :2000 dilution of Cy 3 -conjugated AflnniPure sheep anti-mouse IgG and 
a 1:200 dilution of Fluorescein Phalloidin (Molecular Probes) in 2% BSA/PHEM 6.9. Cells were washed in 
2% BSA/PHEM6.9 and mounted in Vectashield Mounting Medium with DAPI. NIH3T3 cells were seeded 

30 at 200,000 cells per chamber slide and allowed to adhere overnight. Cells were stained with 0.4ug/ml rabbit 
anti-PIP or 0.4ug/ml rabbit IgG and detected with Cy3-conjugated goat anti-rabbit. Additionally, cells were 
co-stained with a 1:200 dilution of Fluorescein-Phalloidin. 
L. Examples 

EXAMPLE I - Identification of a PTP HCSF Binding Protein 

35 In order to identify potential substrates for PTP HSCF (Cheng et al., (1996) supra ), we performed 

a yeast two-hybrid screening assay using a cataiytically inactive form of the enzyme as bait and a library 
derived from murine BaO hematopoietic progenitor cells, a cell type that has been previously demonstrated 
to express high levels of this phosphatase (Cheng et al., (1996), supra ). This resulted in the isolation of 
approximately 70 yeast clones which grew in the absence of histidine and which expressed variable levels of 
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p-galactosidase. Sequence analysis of the clones revealed that approximately 40% encoded related sequences 
with slightly divergent 5' fusions with the Gal 4 DNA-binding domain. The sequences of the remainder of 
the clones suggested that they were likely due to artefactual interactions. Analysis of histidine growth and p- 
galactosidase expression of all two-hybrid clones containing these related sequences revealed an absolute 
5 dependence on the inclusion of the phosphatase bait construct in the same cells (data not shown). The longest 
two hybrid clone was used to isolate a full length cDNA from the original Baf3 two hybrid library. 

Fig. 1A illustrates that the protein which interacts with PTP HSCF is a novel 415 residue molecule 
(predicted molecular weight -47,590 D) (SEQ ID NO:l) with significant sequence homology to the S. pombe 
cell cycle protein. CDC15p (SEQ ID NO:26) ? a cytoskeletal interacting protein involved with organization of 

10 the actin ring at the cleavage furrow during cytokinesis (Fankhauser, Cell 82:435-444 (1995)). This homology 
(-26% sequence similarity) stretches over the entire length of both molecules, with the exception of a large 
insertion of approximately 500 residues in the yeast molecule that is not found in the mammalian protein, and 
the yeast protein is the highest scoring homologue in the protein sequence database. 

A number of features are conserved in these two proteins. For example, both have an SH3 domain 

15 at their carboxy termini (Feng et aL Proc. Natl. Acad. Scl USA 92:12408-12415 (1995) and Pawson, (1995) 
supra l and the mammalian SH3 domain appears to be homologous to those found in a number of known 
cytoskeletal regulatory proteins including myosin heavy chain, spectrin, fodrin, hematopoietic specific protein 
1 (HS1) and the p80/85 src substrate, cortactin (Fig. IB). In addition, both the mammalian and yeast 
(Fankhauser, ( 1 995) supra ) proteins contain a potential coiled coil domain at their N-termini which is predicted 

20 both on the basis of sequence homology as well as an analysis of the mammalian sequence using the Prostruct 
program (Fig. 1C). Within these coiled coil domains is a region with an extraordinary content of acidic and 
basic residues (positions 99-180 of the mammalian protein). Because the mammalian protein was isolated on 
the basis of an interaction with a tyrosine phosphatase, it is possible that the protein is tyrosine phosphorylated 
(see below), and examination of the mammalian and yeast sequences revealed 5 conserved tyrosine residues 

25 (positions 53. 191, 287, 367 and 369 of the mammalian protein). Finally, examination of the proteins for 
proline rich regions which might function as SH3 binding sites (PXXP) (SEQ ID NO:27) revealed two such 
conserved sites in these proteins (starting at positions 278 and 323 of the mammalian protein) (Feng et al., 
(1995) supra and Pawson, ( 1995) supra). p80/85 cortactin (Wu et aL, Mol Cell Biol 11:5113-5124 (1991)) 
and HS1 (Kitamura et al., Nuc. Acids. Res. 17:9367-9379 (1989)) are two other mammalian proteins that 

30 contain repeated coiled coil and SH3 domains and that bear a more distant relationship to the PTP HSCF 
interacting protein, although these two proteins contain homologous 37 amino acid repeats in their coiled coil 
regions which appear to be absent from the PTP interacting protein. Because the mammalian sequence was 
isolated based upon its ability to interact with the PEST phosphatase PTP HSCF, it has been termed PSTPIP 
(PST Phosphatase Interacting Protein). 

35 Northern blot analysis of the expression of PSTPIP during embryogenesis and in adult tissues is 

illustrated in Fig. 2. Interestingly, the protein is more highly expressed in the very early 7 day embryo as 
compared to later stages, and it appears to be significantly downregulated in the 1 1 day embryo (Fig. 2B). The 
protein is expressed at relatively high levels in adult lung and spleen and at lower levels in testis, muscle, 
kidney, brain and heart (Fig. 2A). However, the interacting protein is at far lower levels than actin, since the 
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actin blots were exposed for 4 hours versus the one week exposure for the PSTPIP blots. Previously, we and 
others have demonstrated that PTP HSCF is also expressed at detectable levels in both adult lung and kidney 
(Cheng et al., (1996) supra and Huang et al., (1996) supra ). 

RX AMPLE 2 - Characterization of the Interaction Between PTP HSCF and PSTPIP 
5 In order to characterize the regions involved with the binding between PTP HSCF and PSTPIP, a 

rapid and direct in vitro binding assay was performed. In this assay, various GST fusions of either the 
phosphatase or the interacting protein were used to precipitate in vitro translation products of the cognate 
binding proteins. Fig. 3 illustrates that precipitation of in vitro translated PTP HSCF by GST fusion proteins 
containing various SH3 domains as well as full length PSTPIP revealed a high degree of specificity in the 

1 0 interaction between the GST PSTPIP and the phosphatase. Fig. 3 also illustrates that at this concentration of 
GST fusion protein (-1 microgram per ml or -1.5 micromolar), the PSTPIP fusion protein appeared to be 
more efficient at precipitating the phosphatase than a polyclonal antibody directed against the enzyme or a 
monoclonal antibody directed against a hemagglutinin tag at the PTP N-terminus (data not shown). This result 
is consistent with a relatively high affinity interaction between the GST PSTPIP and the in vitro translated PTP 

15 HSCF (see below). 

The region of PTP HSCF which interacts with PSTPIP was identified by producing deletion 
mutants of the enzyme missing either the 20 amino acid C-terminal domain which is highly conserved in all 
of the PEST PTPs (Yang et al., (1993) supra, Matthews et al., (1992) supra , Cheng et al., ( 1 996) supra, Huang 
et al., ( 1 996) supra . Aoki et aL, ( 1 996) suora . Dosil et al., ( 1 996) supra and Kim et al., ( 1 996) supra ) or both 

20 this domain as well as the longer proline, serine and threonine rich region C-terminal to the catalytic domain 
(Fig. 4 A). Fig. 4B-C reveal that deletion of the C-terminal 20 amino acid homology domain of PTP HSCF 
completely abolished the interaction between PTP HSCF and PSTPIP. Because this region is conserved in 
all PEST PTPs, it is possible that both PTP PEST (Yang et al., (1993) supra ) as well as PTP PEP (Matthews 
et al., (1992) supra ) also interact with PSTPIP. In order to examine this possibility, as well as to examine if 

25 the 20 amino acid C-terminal region is sufficient for this interaction, 20 residue long peptides derived from 
the homologous C-terminal domain of three PEST PTPs were used to compete with the interaction between 
PTP HSCF and PSTPIP (Fig. 4D). In this form of the assay, a GST fusion derived from the PST-rich and C- 
terminal homology regions of the phosphatase was used to precipitate in vitro translated PSTPIP in the 
presence of varying amounts of peptides. Fig. 4D illustrates that all three peptides effectively block the 

30 interaction at concentrations as low as ~800nM, while a control peptide derived from a different proiine-rich 
region of PTP HSCF is completely unable to block the interaction. These data suggest that this small proline 
rich region of the PEST PTPs is sufficient for mediating the high affinity interaction between the phosphatase 
and PSTPIP, and furthermore indicate the possibility that all of these PTPs may interact with PSTPIP via the 
C-terminal homology domain. 

35 In order to examine the region of PSTPIP which interacts with the C-terminal homology region, 

GST fusions containing either the SH3 domain or the coiled coil domain of the interacting protein were used 
to immunoprecipitate in vitro translated PTP HSCF. The C-terminal homology region which interacts with 
PSTPIP contains 2 overlapping consensus SH3 (PXXP) (SEQ ID NO:27) binding sites, consistent with the 
possibility that the phosphatase- PSTPIP interaction was an SH3-type binding event (Pawson, ( 1 995) supra and 
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Feng et al., (1995) supra ). However, the affinity of the interaction as measured in the peptide experiment was 
significantly greater than many of those previously reported for SH3 domain-PXXP (SEQ ID NO:27) 
interactions (Feng et al., (1995) supra ), and as Fig. 5 illustrates, the interaction between these proteins was 
surprisingly mediated by the coiled coil domain and not the SH3 region. This outcome is consistent with the 
5 results of the two hybrid clones, all of which began at a site very close to the N-terminus of the coiled coil 
domain, suggesting that the PSTPIP site which interacts with the C-terminal proline rich domain includes the 
N-terminus. Thus, these data define a novel, apparently high affinity interaction between the C-terminal 
proline rich domain of PTP HSCF and the coiled coil region of PSTPIP. 
EXAMPLE 3 - PSTPIP is a Substrate for PTP HSCF Phosphatase Activity 

10 The association between PTP HSCF and PSTPIP suggested that the interacting protein might be 

a substrate for the phosphatase. In addition, the conservation of a number of tyrosines between PSTPIP and 
the highly phosphoryiated CDC 15 protein was also consistent with the possibility that the interacting protein 
was tyrosine phosphoryiated. As Fig. 6 demonstrates, endogenous PSTPIP is indeed tyrosine phosphoryiated 
in Baf 3 cells, and this phosphorylation was significantly enhanced by the tyrosine phosphatase inhibitor 

1 5 vanadate, consistent with the supposition that the protein is dephosphorylated in vivo by a PTP enzyme (Dixon, 
Ann. NY Acad Set 766:18-22 (1995)). 

A potential tyrosine kinase which might phosphorylate PSTPIP in vivo is src. Previous data 
suggested that the V-src tyrosine kinase is associated with the cytoskeleton, modulates cytoskeletal elements 
which resulted in profound morphological changes (Cooper et al., Cell 73: 105 1-1054 (1993), Kaplan et al., 

20 EMBO J. 13:4745-4756 (1994) and Thomas et al, Nature 376:267-271 (1995)) and mediates the tyrosine 
phosphorylation of p80/85 cortactin (Wu et al., (1991) supra, Okamura et al., J. Biol. CAeiw.270(44):26613- 
266 18(1 995), Vuori et al., J. Biol. Chem. 270(38):22259-22262 (1995) and Dehio et al., EMBO J. 14:2741- 
2782 (1995)), an SH3, coiled coil containing actin binding protein that bore a structural similarity to PSTPIP. 
In addition, HS1, another SH3 containing protein that is also structurally similar to PSTPIP, is tyrosine 

25 phosphoryiated by various SRC-family kinases (Yamanashi et al., Proc. Natl. Acad. Sci. USA 90:363 1-3635 
(1993), Nada et al., Oncogene 9:3571-3578 (1994), Takemoto et al., EMBO J. 14:3403-3414 (1995) and 
Takemoto et al.. Int. Immunol. 8(1 1): 1699- 1705 (1996)). These results implied that V-src, a constitutively 
active form of the enzyme, might mediate the tyrosine phosphorylation of PSTPIP, thus allowing for an 
analysis of the possible substrate interactions between the interacting protein and PTP HSCF. In order to test 

30 this possibility, PSTPIP was transfected into COS cells together with the V-src tyrosine kinase and either wild 
type or dominant negative forms of PTP HSCF. Dominant negative phosphatases were produced by mutating 
either the active site cysteine to a serine (C229-S), which abolishes the ability of the enzyme to form a covalent 
transition state intermediate with the phosphate attached to the tyrosine, or mutation of a critical active site 
aspartate residue to alanine (D197-A), which inhibits the catalytic removal of the phosphate (Dixon, (1995) 

35 supra . Jia etal.. Science 268(52 18): 1754- 1758 (1995) and Garton et al., (1996) sjmra). In both cases, these 
mutants will tightly bind to the substrate but not dephosphorylate it, with the result being that the substrate will 
be hyperphosphorylated. This procedure has been previously utilized to characterize substrates for a number 
of different PTPs, including PTP PEST (Garton, (1996) supra) and PTP SHP-2 (Herbst et al., Cell 85:899-909 
(1996)), and it has revealed that these mutant enzymes show exquisite substrate specificity in vivo. 
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As can be seen from Figs. 6B-F, PSTPIP is tyrosine phosphorylated in response to V-src 
cotransfection. Transfection of the wild type PTP HSCF into PSTPIP and V-src expressing ceils resulted in 
a decreased level of tyrosine phosphate on the interacting protein, consistent with the in vivo removal of the 
phosphate from PSTPIP tyrosines by the phosphatase enzyme, a result that would be expected if the interacting 
5 protein were a substrate for the enzyme. Even more compellingly, Figs. 6B-F also illustrates that 
cotransfection of either dominant negative form of PTP HSCF into PSTPIP and V-src transfected cells resulted 
in a dramatic increase in the levels of tyrosine phosphate on the interacting protein. It appears that the Di 97-A 
mutation was a slightly more efficient dominant negative protein than the C229~S mutant, consistent with 
results found using dominant negative forms of PTP PEST interacting with one of its substrates, pl30CAS 

1 0 (Garton, ( 1 996) supra ). 

These results, in addition to the in vitro binding studies, suggest a direct physical interaction 
between PSTPIP and PTP HSCF, and Figs. 6B-F also illustrate the in vivo physical association of these 
proteins by demonstrating the coprecipitation of either PSTPIP or PTP HSCF by antibodies to epitope tags 
on the cognate binding proteins. These data are thus consistent with the conclusion that PSTPIP interacts with 

15 PTP HSCF in vivo, and that this interaction allows the phosphatase to dephosphorylate tyrosine residues 
modified by the V-src kinase. In addition, because tyrosine phosphorylated PSTPIP was only observed in cells 
that were transfected with V-src, these data also suggest that COS cells may be deficient in the cellular kinase 
which tyrosine phosphorylates PSTPIP, or that the dramatic overexpression of the protein in these ceils 
overwhelmed the endogenous tyrosine phosphorylation mechanism. 

20 EXAMPLE 4 - Subcellular Localization of PSTPIP 

S. pombe CDC1 5p is associated with the conical actin cytoskeleton until it migrates to a region over 
the postmitotic nucleus and initiates formation of the actin-rich cleavage furrow (Fankhauser, (1995) supra ). 
The protein remains associated with the cleavage furrow until the completion of cell division, when it migrates 
back to the region of the cell containing cortical actin. In order to analyze the subcellular localization of 

25 endogenous PSTPIP, 3T3 cells were stained with an affinity purified polyclonal antibody directed against a 
GST fusion of the protein and were imaged using confocal microscopy. Fig. 7 illustrates that the interacting 
protein is colocalized to several actin-containing sites in the cell. A large portion of the protein appears to be 
associated with the cortical actin cytoskeleton on the intracellular side of the plasma membrane. The protein 
also appears to co-localize with the actin stress fibers as well as in lameilipodial regions of the cell containing 

30 actin. 

In addition, transfection of PSTPIP into CHO cells revealed expression at sites of focal contact (data 
not shown). These results are in contrast with the PSTPlP-related protein p80/85 cortactin, which shows 
localization on cortical actin and at the ends of the stress fibers but not the fibers themselves (Wu et al., ( 1 99 1 ) 
supra ). As is the case with S.pombe CDCISp (Fankhauser, (1995) supra ), these data suggest that PSTPIP is 
35 associated with cytoskeletal actin during the non-cytokinetic stages of the cell cycle. 

Importantly, examination of cells undergoing cytokinesis reveals that endogenous PSTPIP is 
predominantly associated with the cleavage furrow (Fishkind et al.. Current Opinion in Cell Biology 7:23-3 1 
(1995) and Fankhauser, (1995) supra ). As Fig. 7 shows, both PSTPIP and the actin ring co-localize to this 
region of the dividing cells. Fig. 7 also illustrates that the PSTPIP in the cleavage furrow is predominantly 
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associated with the membrane bound F actin which acts to constrict the cleavage furrow (Fishkind, (1995) 
supra ), and examination of sections, taken perpendicular to the cleavage furrow support this, showing donui- 
like structure containing both PSTPIP and actin attached to the constricting plasma membrane of the cleavage 
furrow (data not shown). It also appears from Fig. 7 that much of the cortical ly associated actin and PSTPIP 
5 migrate to the cleavage furrow during cytokinesis, a result that is remarkably similar to that observed for yeast 
CDC15p and actin (Fankhauser, (1995) supra ). These subcellular localization data are thus consistent with 
the conclusion that PSTPIP is an actin binding protein that is potentially involved with the regulation of the 
cleavage furrow. 

EXAMPLE 5 - Filopodial Induction by Overexpressed PSTPIP 

1 0 One role that PSTPIP might play in the cleavage furrow is the reorganization of polymerized actin 

(CaoetaUJ. Cell Biol. 111:1905-1911 (1990a), Cao et aL, J. Cell Biol. I 10: 1089-1095 (1 990b), Fishkind et 
aI.,J. CellBioL 1 23 (4): 83 7-848 ( 1993) and Fishkind, (1995) supra). In order to examine the possible function 
of PSTPIP in actin assembly, 3T3 cells were transfected with an epitope tagged version of the protein under 
the control of the powerful cytomegalovirus promoter, and the transfected cells were subsequently examined 

15 for expression of transfected PSTPIP as well as F actin. As can be seen in Fig. 8, 3T3 cells with normal 
morphology which expressed transfected PSTPIP showed co-localization of the protein at the cortical surface 
with F actin as well as in lamellipodial structures and the F actin stress fibers, in agreement with data obtained 
examining endogenous PSTPIP localization (see Fig. 7). Fig. 8 also illustrates that the overexpression of the 
protein often induced a remarkable morphological change in a high percentage of cells expressing it. These 

20 cells contained extended, filopodial-like structures that were filled with polymerized actin. In many cases, the 
structures were up to -150 microns in length, and they often showed a knob-like morphology. In addition, 
the majority of cells contained a single extended filapodial structure. It appears that this structure was 
probably produced in the absence of significant cell growth or plasma membrane synthesis, since the overall 
size of the cell body appeared to decrease dramatically concomitant with the lengthening of the filapodial 

25 structure. This type of cell morphology is never observed with transfection of the green fluorescent protein 
(data not shown), and Fig. 8 illustrates that it is very different from the morphology of normally elongated, 
non-transfected cells. In summary, these results suggest that the unregulated expression of PSTPIP in vivo 
results in the induction of extended ftlopodial-like structures, consistent with the possibility that the protein 
may induce an inappropriate polymerization of the cortical cytoskeleton. 

30 EXAMPLE 6 - N-terminal Deletions in PSTPIP 
Materials and Methods 

Deletional mutagenesis 

Deletions in the PST PIP molecule were made from both the amino terminus and carboxy- 
terminus. The deletions were constructed from pfu-PCR fragments which were ligated back into the original 
35 PST PIP expression vector (Spencer et aL (1 997), supra ). The PCR primers for the N-terminal deletions were 
(all 5* to 3'): 

N-coil.l; CAGTTCGGATCCATGATGCTGCAGAGGCTGCTGGACGGCAGG - Leu26 
(SEQ ID NO: 30) 

N-coiL2; CAGTTCGGATCCATGATGGAGAGGTACGGGAAGGAGCTGGTG - Glu51 
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(SEQ ID NO: 31) 

N-coil.3; CAGTTCGGATCCATGATGTCCTTTGACTCCCTGAAGCAGCAA - Ser 76 
(SEQ ID NO: 32) 

N-coil.4; CAGTTCGGATCCATGATGGAGCTGCGGAGCCTGGAGGAGTTC - GlulOl 
5 (SEQ ID NO: 33) 

N-coil.5; CAGTTCGGATCCATGATGGTCCAGAAGAGCAAGTTGTCGCTC - Alal51 
(SEQ ID NO: 34) 

N-coil.6; CAGTTCGGATCCATGATGGCAGATGATGCTGAGCAGGCCTTC - Ala201 
(SEQ ID NO: 35) 

1 0 Common 3-prime end primer; ACGTCACTCG AGTCACTTGTCATCGTCGTCCTT 
(SEQ ID NO: 36) 

The primers for the C-terminal deletions were: 
C-coil.l; TTGACCTCGAGTCATCACCGCTCAGGGGTGGGAGTCAGAGTC - Arg339 
(SEQ ID NO: 37) 

15 C-coiI.2; TTGACCTCGAGTCATCACAGCCCAGAGAACCTCTTTATCA - Leu314 
(SEQ ID NO: 38) 

C-coil.3; TTGACCTCGAGTCATCAGTCATAGTAGTTCTGATAAGGCACCGGA - Asp289 
(SEQ ID NO: 39) 

C-coil.4; TTGACCTCGAGTCATCAGTCACCTTCCACATCACAGCCCTCAAGGGTC - Asp264 
20 (SEQ ID NO: 40) 

C-coil.5; TTGACCTCGAGTCATCAGGAGAGCTGGTTACAGTGCACCCACAGGGCA - Ser239 
(SEQ ID NO: 41) 

C-coil.6; TTGACCTCGAGTCATCACTCACGCAGGGCCAGGGCCAGCTGGATGTG - GlulOO 
(SEQ ID NO: 42) 

25 The primer N-coiL 1 was used as the common 5-prime PCR primer for the C-terminal deletions. 
In vitro and in vivo analysis o/PSTPIP and PTP HSCF interactions 

In vitro binding analyses between various PTP HSCF and PST PIP constructs were performed as 
previously described (Spencer et al, (1997), supra ). Briefly, plasmids were transcribed and translated in vitro 
using the TnT Rabbit Reticulocyte Lysate System (Promega), * Samples were diluted in 50 mM HEPES, pH 

30 7.2, 1% triton X-100, 10% glycerol, 100 mM NaCl, and 2 ug/ml each of leupeptin, pepstatin, aprotinin and 
PMSF (lysis buffer). Samples were than reacted with GST fusion proteins at various concentrations, and the 
bound proteins were centrifuged using glutathione-sepharose beads and analyzed on SDS poly aery lam ide gels. 
C-terminally-derived peptide inhibition studies were performed by incubating the GST fusion protein binding 
reactions in the presence of 10 ug/ml of the indicated peptides. Peptides were produced by FMOC-protected 

35 amino acids as previously described (Spencer et al, ( 1 997), supra ). In vivo interactions between various forms 
of PTP HSCF and PST PIP were also performed as previously described (Spencer et al, (1997), supra ). 
Briefly, COS cells were transfected with various constructs, and after 48 hours, lysates were prepared and 
immunoprecipitated with antibodies to either the FLAG epitope contained at the C-terminus of PST PIP or 
an HA epitope contained at the N-terminus of PTP HSCF. The resultant blots were probed with anti-FLAG 
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to detect PST PIP, anti-HA to detect PTP HSCF or anti-phosphotyrosine to detect levels of this modified 
amino acid in each protein. 

Confocal microscopy of transfected cells 

Confocal microscopy was performed as previously described (Spencer et al, (1997), supra ), 
5 Briefly, CHO cells in chamber slides were transfected using Lipofectamine and the indicated plasmids. 48 
hours later, cells were fixed in formaldehyde and stained with an anti-FLAG epitope specific antibody 
(Kodak) and Fluorescein-Phalloidin (Molecular Probes). Anti-Flag stained cells were washed and stained with 
Cy3-conjugated sheep anti-mouse IgG. Stained cells were observed using a Molecular Dynamics Confocal 
Microscope (2001) and analyzed with ImageSpace software (Molecular Dynamics). 
10 Results 

PST PIP was originally isolated as a binding partner of the PEST-type PTP, PTP HSCF, in a yeast 
two-hybrid screen. Interestingly, ail of the clones isolated in this procedure began within 10-15 amino acids 
of the N-terminus of PST PIP, consistent with the proposal that the N-terminus was critical for binding to PTP 
HSCF. In order to examine this possibility, deletions of 25 (delta 25), 50 (delta 50) and 75 (delta 75) amino 

15 acids of the N-terminus of the PST PIP coiled-coil domain were created. These deletion mutants were 
produced by in vitro transcription/translation, and they were tested for binding to a GST fusion protein 
containing the C-terminal 149 amino acids of PTP HSCF, including the proline-rich PST PIP binding site 
(GST-PTP HSCF), As figure 9 illustrates, the full length and delta 25 forms of PST PIP were capable of 
interacting with the GST PTP HSCF fusion protein, while the delta 50 and delta 75 forms were not. This lack 

20 of binding might have been due either to a deletion of the actual binding site or to a misfolding of the protein. 
Transfectton of either the full length (Figure 10) or coiled-coil domain of PST PIP results in co-localization 
of the proteins with the cortical actin cytoskeleton and the lamellipodia, an event which presumably requires 
a correctly-folded protein. Thus, the analysis of the cellular localization of mutant forms of PST PIP can be 
utilized as an assay for correct folding of the protein. Figure 10 illustrates that both the wild type and delta 

25 25 forms of PST PIP predominantly co-localized with the cortical actin cytoskeleton, while the delta 50 and 
delta 75 forms of the protein both formed large aggregates within the cytoplasm and showed no cortical 
localization, consistent with the hypothesis that these deletion mutants were improperly folded. These data 
are consistent with the results of the original two-hybrid assay, and they suggest that the N-terminus of PST 
PIP is required for the formation of a correctly folded protein that is capable of binding to PTP HSCF. 

30 Discussion 

The modulation of the tyrosine phosphorylation of a diversity of cellular proteins by protein 
tyrosine phosphatases is a critical aspect of cellular regulation (Neel and Tonks, Opin. Cell Biol, 9(2): 193-204 
(1997). Many of these enzymatic dephosphorylations are mediated by the recognition of phosphotyrosine 
residues by SH2-type domains as well as direct recognition of the substrates by the catalytic domains of the 
35 enzymes (Garton et al, (1996), supra : Saxton et ai, EMBO J. 16(9):2352-2364 (1997)). Here we describe 
a novel mechanism for the regulation of tyrosine phosphorylation which involves the recognition of a proline 
rich motif at the C-terminus of the PTP by a tryptophan-containing site in the cytoskeletal-associated protein, 
PST PIP, which is divergent from the previously described SH3- and WW-type poly-proline binding modules. 
Because this protein-protein interaction appears to be required for the dephosphorylation of PST PIP 



-45- 



WO 98/35037 



PCT/US98/01774 



phosphotyrosines (Spencer et al ( 1 997), supra ), it may be a potentially important new mechanism for the 
regulation of the cytoskeleton. 

The mechanisms utilized by both SH3 and WW domains in recognizing proline-rich helices have 
been elucidated through structure-function analyses using X ray crystallography, NMR and site directed 
5 mutagenesis. The SH3 domain consists of a highly structured 60 amino acid long module which appears to 
fold properly when expressed in the absence of other protein domains, and this short motif is capable of 
binding to proline-rich peptides with relatively high affinity (Terasawa et al., Nat. Struc. Bioi., 1:891-897 

(1994) ; Wittekind et al., J. Mot. Biol. 267(4):933-952; Feng et aL, Proc. NatL Acad. Sci USA 92:12408-12415 

( 1995) ). The WW domain is also a relatively small (-38 amino acids), highly structured motif that is capable 
1 0 of forming an active protein when expressed in the absence of other modules (Macias et aL, Nature 382:646- 

649 (1996)). This is in contrast to the poly-proline recognition sequence found in PST PIP. In this case, 
deletion of the N-terminal 50 amino acids of the protein resulted in an apparently misfolded molecule that 
did not bind to the C-terminal proline-rich domain of PTP HSCF. These data are consistent with the 
possibility that this type of poly-proline recognition domain may require a greater complexity of interactions 
1 5 than either the SH3 or WW modules. 

EXAMPLE 7 -Mutational analysis of PSTPIP 
Materials and Methods 
Mutagenesis 

The mutagenesis of PST PIP was accomplished using the Dut/Ung" procedure (BioRad 
20 Laboratories, Richmond, CA). The mutagenesis primers were designed to change 3 contiguous amino acids 
to alanine. Mutations were spaced approximately 12 amino acids apart, with new restriction sites engineered 
in for identification of mutant clones. Primer annealing was carried out at 70°C for 10 minutes, 37°C for 10 
minutes, room temperature for 5 minutes, and then on ice prior to T7 DNA polymerase addition. The primers 
used for PST PIP alanine-scan mutagenesis were (all 5' to 3'): 
25 D38VE: GTCTGAGGAGCTCCGCCGCAGCCTTGCAC. (SEQ ID NO:43) 

E50ER: CCTTCCCGTACGCCGCCGCCGCCTGAGCTCTCTG (SEQ ID NO: 44) 
R62K: GGCCACCAGCCGCGGCTGCAATCTGCACGAGC (SEQ ID NO: 45) 
R73TS: CAGGGAGTCAAAGGCGGCCGCCAGGGAGTTCATC (SEQ ID NO: 46) 
N84VG: CTGGATGTGCGCGCTGGCCGCAGCCTCTGTTTGC (SEQ ID NO: 47) 
30 R99EE: CCTCCAAGCTTCGCAGCGCCGCAGCCAGGGCCAGGGC (SEQ ID NO: 48) 
Ei 10RQ: CCGCTGCTCTTTCGCTGCCGCTCGGAATTCCTCC (SEQ ID NO: 49) 
1 122MD: CTTCTGGACACGGGCCGCGGCGGCCTCATACTTCT (SEQ ID NO: 50) 
L132YK : GGTCTTCTTGGCGGCCGCAAGCTTGCTCTTCT (SEQ ID NO: 51) 
D145QK: GCATCCCTGCACGCCGCGGCATATAAGCTTTCTTGGACTCCA 
35 (SEQ ID NO: 52) 

El 59RV: GTGGCCATTGGCACTCGCAGCCGCGAAAGCTTGCTCAGCATC 
(SEQ ID NO: 53) 

Ql 69VE: GGCTTTGTTCTGGCTCTTTGCTGCTGCCTTCTGGTGACCATTGGC 
(SEQ ID NO: 54) 
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R 1 94QN: CCTCGCTCTCTCCAGTTGTTCAATAGCTGCCGCGTACACTCT 
(SEQ ID NO: 55) 

W204: CTCCTGCTCCGCCTCGGTCCGAGCTCTCTCC (SEQ ID NO: 56) 
F221: GGATGGTGAGCCGGTCTGCCTCCTGCAGCTGGAGGCC (SEQ ID NO: 57) 
5 L224 : CGGAGGATGGTGGCCCGGTCGAATTCCTGCAACTGG (SEQ ID NO: 58) 
W232: ATGGAGAGCTGGTTACAGTGCACCGCCAATGCATTGCGGAGG 
(SEQ ID NO: 59) 

Cysteines within the region of PTP HSCF binding domain of PST PIP were also mutated to alanine using the 

following primers (all 5' to 3'): 
10 Cys36 : TCCACATCCTTGGACATCTTCCTG (SEQ ID NO: 60) 

Cysl48: CATCATCTGCATCCCTGGCCTTCTGGTCATATCCCTTCTTG (SEQ ID NO:61) 

Cysl80: GGCTGACTCCTTGGACTGCTTGGC (SEQ ID NO; 62) 

Cys2I3: GAAGGCCTCGGAGGTAGTCCGGTGCTC (SEQ ID NO: 63) 

Cys235: ATGGAGAGCTGGTTGGAGTGCACCCACA (SEQ ID NO: 64) 
1 5 Cys242: CATCATCCTTGACGGACTGCATGGAGAGCT (SEQ ID NO: 65) 

Cys259: GATGTCACCTTCCACATCGGAGCCCTCAAGGGTCAG (SEQ ID NO: 66) 

In vitro and in vivo analysis of PSTPIP and PTP HSCF interactions and confocal microscopy 

of transfected cells were performed as described in Example 6. 

Analysis of v-Src mediated PST PIP tyrosine phosphorylation 
20 COS cells were transfected with a constant amount of either the wild type or W232A mutant of PST 

PIP and cotransfected with increasing amounts of a plasmid encoding the v-Src tyrosine kinase (Spencer et 

al (1997), supra ). Plasmid concentrations were equalized with an empty vector containing the CMV 

promoter. After 48 hours, lysates were prepare and immunoprecipitated with anti-FLAG monoclonal 

antibody. Precipitates were run on SDS polyacrylamide gels, blotted, and probed with either anti-FLAG 
25 antibody or anti- phosphotyrosine antibody as previously described (Spencer ei al, (1997), supra ). Bound 

antibodies were visualized using enhanced chemiluminescent reagents. 
Results 

In order to more accurately define the binding site within the PST PIP coiled-coil domain, a 
collection of cluster and point mutations was produced within this domain. Preliminary deletion mapping 

30 suggested that binding of PTP HSCF could be obtained with a form of PST PIP containing amino acids 1-264 
of the coiled coil domain, and the mutagenesis was thus confined to this region of the protein. Because protein 
folding appeared to be critical for the binding of PTP HSCF (Figures 9 and 10), all 6 cysteine residues within 
this part of the protein were mutated to serines, and the resultant mutants were tested for interaction with the 
PTP HSCF GST fusion protein. The elimination of individual cysteine residues did not appear to affect the 

35 binding of these two proteins, suggesting that the protein could fold and function appropriately in the absence 
of individual cysteines (data not shown). Further mutational analysis of PST PIP was thus performed to 
identify a region(s) that was potentially directly involved with PTP HSCF binding. Clustered alanine 
substitutions were produced at approximately 12 amino acid intervals throughout the PST PIP coiled-coil 
domain, and each mutant was subsequently tested for binding to the PTP HSCF GST fusion protein. 
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Mutation of residue clusters L26QR, D38VE, E5OER, R62K, R73TS, N86VG, R99EE, El lORQ, 1 122MD, 
L133YK, D145QK, El59RV,Ql69VE, E184S and R194QN individually to alanine resulted in either no or 
a minor change in the binding activity of these two proteins in vitro (data not shown). In the foregoing 
designations, the subscripted number following the first one letter amino acid code identifies the amino acid 
5 position in SEQ ID NO: 1 where the alanine substitution begins, while the subsequent one letter amino acid 
codes identify the other, congruent amino acids which were replaced by alanine in each individual cluster. 
Figure 1 1 illustrates that the mutation of the tryptophan residue at position 232 of PST PIP to alanine resulted 
in a complete loss of binding to the PTP HSCF GST fusion protein in vitro. Additionally, figure 12 shows 
that co-transfection of wild type PST PIP together with PTP HSCF into COS cells resulted in in vivo 

1 0 association of the proteins, as previously described (Spencer et a/, ( 1 997), supra ), while cotransfection of the 
W232A mutant of PST PIP resulted in a complete lack of in vivo association, consistent with the in vitro 
binding studies. As expected, this mutant non-binding form of PST PIP could no longer be "substrate trapped" 
(imeta/., (1995), supra : Garton et a/., (1996V supra : Flint et aL, (1997), supra : Spencer et al, (1997), supra ) 
by a dominant negative Cys-Ser mutant of PTP HSCF (Figure 12), although it could clearly be tyrosine 

1 5 phosphorylated in the presence of v-Src (Figure 4) or pervanadate (data not shown). Thus, while the wild type 
PST PIP showed enhanced tyrosine phosphorylation in the presence of a dominant negative substrate trapping 
form of PTP HSCF (PTP HSCF C-S)(Jia eta/.. (1995), supra : Garton eta/., (1996), supra : FYmtetaL (1997), 
supra; (Spencer et al. (1997), supra ), the W232A mutant of PST PIP was not hyperphosphoryiated in the 
presence of this mutant form of the enzyme (Figure 12). Because previous results suggested that tryptophan, 

20 aromatic and hydrophobic residues are involved with the recognition of proline-rich domains when found 
appropriately spaced together in the context of other residues in WW-type domains (Macias et aL, (1996), 
supra; Chen et al., (1997), supra ), we examined the PST PIP sequence for these residues near W232. This 
examination revealed that W232 is 27 amino acids C-terminal to another tryptophan at position 205. In 
addition, a phenylalanine (F221) and leucine (L224) residue also occur near the W232 residue with a spacing 

25 that is reminiscent of the WW motif (Andre et aL, Biochem. Btophys. Res. Comm. 205(2): 1201-1205 (1994). 
However, when these residues were mutated to alanine, no effect on PTP HSCF binding was observed in 
vitro (Figure 11). Thus, while the juxtaposition of these two tryptophan residues, together with the 
involvement of W232 in the recognition of the PTP HSCF proline-rich motif, is reminiscent of the WW 
module, comparison of the region containing these nearby tryptophans to the consensus sequence described 

30 for WW-type domains (Andre et aL, ( 1 994), supra )revea\s that most of the conserved residues within the WW 
module are not found in this region of PST PIP (data not shown). In addition, the spacing of the two 
tryptophan residues in PST PIP is somewhat longer than that found for typical WW-type motifs (27 amino 
acids for PST PIP versus -22 amino acids for consensus WW domains). Finally, in order to insure that 
mutation of the critical tryptophan did not result in a global effect on protein folding, as was observed for the 

35 delta 50 and 75 deletion mutants (Figure 10), the W232A mutant was transfected into CHO cells and analyzed 
by confocal microscopy. As Figure 10 illustrates, this mutant protein appears to co-localize with the cortical 
actin cytoskeleton in a manner that is indistinguishable from the wild type protein, consistent with the 
supposition that the W232A mutant was appropriately folded in vivo. These data thus suggest that tryptophan 
residue 232 may be directly involved with the interaction between PST PIP and PTP HSCF. In addition. 
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because W232 does not appear to be embedded in a typical WW module (Andre et al., (1 994), supra ), they 
also suggest that this region defines a novel type of protein -protein recognition motif. 

Previously, we demonstrated that PST PIP was tyrosine phosphorylated when co-transfected with 
v-Src tyrosine kinase (Spencer et al, (1997), supra ). In addition, we showed that this tyrosine phosphorylated 
5 PST PIP was a substrate for dephosphorylation or "substrate trapping" by wild type or dominant negative PTP 
HSCF, respectively, and that the substrate trapping activity required an interaction between the two proteins 
mediated by the C-terminal proline rich region of the PTP (Spencer et al. (1997), supra ). We also established 
that an endogenous tyrosine kinase(s) was capable of phosphorylating tyrosines within PST PIP in both BaF3 
and transfected COS cells, and an endogenous tyrosine phosphatase(s) was capable of dephosphorylating these 

10 tyrosine residues. In addition, preliminary evidence shown in Figure 12 suggested that the W232A mutant 
was more efficiently phosphorylated than the wild type PST PIP in the presence of v-Src. In order to more 
quantitatively examine the role of the W232 residue in v-Src-induced tyrosine phosphorylation, we transfected 
constant amounts of the wild type and W232A mutant forms of PST PIP into COS cells together with 
increasing quantities of the v-Src expression plasmid and subsequently analyzed the levels of phosphotyrosine 

15 in immunoprecipitated PST PIP. Figure 15 illustrates that the W232A mutant form of PST PIP, which was 
deficient in binding to PTP HSCF, was significantly more efficiently tyrosine phosphorylated in the presence 
of v-Src in vivo than the PTP-binding wild type protein, confirming the initial data shown in Figure 12. These 
data are consistent with the hypothesis that PST PIP likely interacts with, and is dephosphorylated by. an 
endogenous PEST-type PTP in COS cells, and the loss of this interaction, as observed with the W232A 

20 mutant, results in enhanced tyrosine phosphorylation of the protein in the presence of v-Src. 
Discussion 

The importance of overall structure to ligand recognition by the WW domain is emphasized by 
mutation of the proline which is C-terminal to the critical tryptophan recognition residue (Chen et al. (1997), 
supra ). Mutation of this residue, which is conserved in all WW motifs, to alanine results in an inactive WW 

25 module, presumably due to a disruption in the fold of the domain. The PST PIP poly-proline recognition 
sequence is missing this highly conserved proline (Spencer et al, (1997), supra ), consistent with the possibility 
that other residues in the protein may be involved with the formation of the ligand binding site. Of potential 
importance is the finding that the region containing the poly-proline recognition sequence in PST PIP is in a 
domain that is predicted to form a coiled coil [31], and preliminary data suggest that this area of PST PIP 

30 mediates dimerization, a characteristic of coiled coil-containing proteins. This, together with the results of 
the N-terminal deletion studies discussed in Example 6, suggests that the overall fold of this relatively 
extended domain might be critical for the formation of a correctly structured poly-proline recognition site. 

While these results suggest that the PST PIP poly-proline recognition domain is functionally and 
structurally divergent from the SH3 and WW modules, an interesting connection between these binding motifs 

35 is the inclusion of a critical tryptophan residue in all three domains. In both SH3 and WW motifs, these 
tryptophans are conserved in all of the modules that have thus far been identified. In the case of both the SH3 
(Feng et ai t Science 266:1241-1247 (1994)) and WW (Chen et al. (1997), supra ) motifs, the tryptophan 
appears to be critical for the interaction with the proline rich peptide, as mutation of this residue results in 
diminished binding. Interestingly, this is also the case for the PST PIP proline-rich recognition site, consistent 



-49- 



WO 98/35037 



PCT/US98/01774 



with the possibility that tryptophan residues are uniquely suited for the recognition of poly-proline rich 
domains. Structural data from both SH3 and WW domains confirms' this hypothesis. In the case of the SH3 
domain, the conserved tryptophan residue is found in the binding pocket, and this residue appears to interact 
by stacking with helically oriented prolines in the proline rich ligand (Feng et al., (1994), sw£rtf;Terasawa et 
5 ai, (1994), supra : Wittekind et al., (1997), supra )). NMR analysis of the WW domain from the yes kinase 
associated protein (YAP) (Sudol et ai, J. Biol. Chem. 270(24): 14733-14741 (1995)) likewise reveals an 
interaction between the conserved tryptophan residue and prolines in the proline helix recognized by this 
protein, although it is also possible that this conserved tryptophan is involved with the structure of the binding 
pocket (Marias, et al., (1 996), supra : Chen et al. (1997), supra ). The fact that mutation of a single tryptophan 

1 0 in the coiled coil region of PST PIP abolishes both in vitro and in vivo binding to the phosphatase is consistent 
with the hypothesis that this tryptophan residue may similarly interact with potentially helically oriented 
prolines in the C-terminus of the PEST PTPs. Alternatively, it is possible that conversion of this hydrophobic 
residue to an alanine results in a misfolding of the protein. However, if the W232A mutant protein is 
improperly folded, it is likely that this is only a localized disruption, since it is still capable of associating with 

15 the cytoskeleton and is tyrosine phosphorylated in the presence of transfected v-Src or the PTP inhibitor, 
pervanadate. Interestingly, the tryptophan located N-terminal to the critical tryptophan involved with binding 
of PST PIP to PTP HSCF does not seem to be required for ligand recognition, a result which is similar to that 
found for the N-terminal tryptophan of the WW domain in YAP (Chen et al. (1997), supra ). Finally, while 
it appears that a number of other residues, particularly with hydrophobic and aromatic side chains, are 

20 involved with the recognition of the proline-rich ligand by both SH3 fl0][8][9] and WW (Marias, et al., 
( 1 996), supra : Chen et al (1997), supra ) domains, mutation of two such residues in PST PIP (F22 1 and L224) 
does not have a significant effect on binding, consistent with the supposition that the poly-proline recognition 
domain of PST PIP is divergent from the WW module. 

The potential importance of W232 in the function of PST PIP is underlined by the finding that 

25 expression of the W232A mutant in COS cells together with the v-Src tyrosine kinase results in an enhanced 
tyrosine phosphorylation of the cytoskeletal associated protein. These data are consistent with the hypothesis 
that PST PIP interacts with endogenous PTPs in vivo, and this interaction mediates the removal of phosphates 
from tyrosine residues. Furthermore, because this mutation blocks the binding of the PEST-type PTP HSCF 
via the C-terminal proline rich domain, these results suggest that it is probable that PST PIP interacts with one 

30 or more endogenous PEST-type tyrosine phosphatases in COS cells. However, the question remains as to why 
the W232A mutant is not constituitively tyrosine phosphorylated in the absence of v-Src, since it is likely that 
the protein is unable to efficiently bind endogenous PEST-type PTPs. While it might be argued that the 
appropriate tyrosine kinase is not present in COS cells, we have previously demonstrated that the protein is 
tyrosine phosphorylated in the presence of vanadate, both in in its endogenous state in BaF3 cells as well as 

35 when it is transfected into COS cells (Spencer et al, (1997), supra ). A likely explanation for these data is that 
the kinase that phosphorylates PST PIP requires an activation event, such as tyrosine phosphorylation, to 
mediate this modification. Thus, v-Src, which is a constituitively activated tyrosine kinase [33], would be 
predicted to mediate the tyrosine phosphorylation of the W232A mutant in the absence of vanadate. In 
addition, the data suggests that vanadate must activate an endogenous tyrosine kinase(s), presumably by 
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inhibiting an endogenous tyrosine phosphatase (Jia et al., (1995), supra ')), which subsequently mediates the 
tyrosine phosphorylation of PST PIP. 
EXAMPLE 8 -Mutational analysis of PTP HSCF 
Materials and Methods 
5 Mutagenesis 

Single codon mutations to alanine were made in the carboxy-terminus of PTP HSCF following the 
procedure described in Example 7, and using the following primers (all 5' to 3'): 
HSCF R436: CCCTTTGGGTCGACCGATGGCCAAGTTGAAGCC (SEQ ID NO:67) 
HSCF P440: AGGATCTCGGGGCCCTTTGGCCCTTCCGATGCGC (SEQ ID NO: 68) 
10 HSCF G442: CTGGAGGATCTCGAGGTGCTTTGGGCCTTCC (SEQ ID NO: 69) 
HSCF P443: GGAGGATCTCGGGCCCCTTTGGGCCTTCCG (SEQ ID NO: 70) 
HSCF R444: CTGCAGGAGGATCCGCGGGCCCTTTGGGCCTTCC (SEQ ID NO: 71) 
HSCF P447: GTCCACTCTGCAGCAGGATCCCGGGGCCCTT (SEQ ID NO: 72) 
HSCF W450: GTTACACCCGTGTCGCCTCTGCAGGAGGATCCCG (SEQ ID NO: 73) 
1 5 The R444+W45O double mutant was made with the HSCF W45O primer on a single-strand template of the 
R444 mutant phosphatase. The mutants were all confirmed by DNA sequencing. 

The other techniques were performed as described in the previous examples. 

Results 

In order to analyze the residues within the C-terminal 20 amino acid region of PTP HSCF that were 

20 critical for PST PIP binding. 20 amino acid peptides with alanines incorporated separately at each position 
were tested for blocking of the interaction in vitro. Previously we demonstrated that a 20 amino acid peptide 
derived from this region of three different PEST-type PTPs (Yang et al, J. Biol. Chem. 268(23): 17650 (1993); 
Matthews et al. t (1992) . supra : Cheng et al, (1996), supra }) was capable of efficiently blocking the binding 
of an in vitro translated form of PST PIP to a GST fusion of PTP HSCF containing the C-termina! 149 amino 

25 acids, including the C-terminal protine-rich binding site (GST PTP HSCF) (Spencer et al. (1997), supra). 
Figure 13 illustrates that alanine replacement of R436, P440, G442, P443, R444, P447. and W450 individually 
in a peptide derived from the C-terminus of PTP HSCF resulted in a decreased inhibition of binding by the 
mutant peptides, while alanine replacement at the other sites within the peptide had little or no effect on the 
ability of these peptides to block the interaction in vitro. Importantly, these residues are conserved in all of 

30 the PEST-type PTP C-termini (Yang et al, (1993), supra : Matthews et al., (1992) . supra : Cheng et al., (1996), 
supra ): Kim et al. Oncogene 13:2275-2279 (1996)), consistent with previous data (Spencer et al (1997), 
supra ) demonstrating that peptides derived from the other members of this family of phosphatases all 
effectively blocked this interaction (Figure 13). To confirm the peptide mutation analysis, each residue found 
to be critical for PST PIP binding in the PTP HSCF C-terminal region was mutated to alanine in the context 

35 of the whole protein, and the ability of each mutant PTP to bind PST PIP was analyzed in vitro and in vivo. 
Figure 14 illustrates that mutants of the phosphatase containing alanines at all but one of the positions 
predicted from the peptide mapping study (P443, Figure 13) were substantially deficient in binding to GST 
PST PIP in the in vitro binding assay, although 10 fold increased amounts of GST PST PIP could interact with 
the mutant PTP HSCF proteins, suggesting only a partial loss of binding. In addition production of a double 
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mutation in two of the critical residues in this region of PTP HSCF (R444 and W45O) resulted in a stronger 
inhibitory effect on binding to PST PIP (Figure 14). In vivo analysis of these point mutants revealed only a 
modest effect on binding in the single mutants of PTP HSCF, consistent with the in vitro data suggesting that 
sufficiently high levels of PST PIP could interact with the mutant proteins. However, as was observed in the 

5 in vitro experiments, the doubly mutated form of PTP HSCF (R444+W45O) was as poor at interacting in vivo 
with PST PIP as the mutant which was missing the entire C-terminai proline rich domain (PTP HSCF D24) 
(Spencer et ai, ( 1 997), supra) . These data confirm the importance of these residues to the binding interaction, 
and they suggest that much of the C-terminal region of PTP HSCF may be required for highest affinity 
binding to PST PIP. 

] 0 Discussion 

Mutational analysis of the proline rich domain of PTP HSCF is compatible with the proposal that 
the PST PIP binding site is a novel poly-proline recognition module. These data demonstrated that the binding 
site in the phosphatase appeared to stretch over a length of approximately 15 amino acids, from R4 3 6 to 
W450- This is in contrast to structural studies on SH3 and WW domain recognition sites, where mutagenesis, 
15 X ray crystallography and NMR analyses have demonstrated that stretches of 10-12 (Feng et ai, (1994), 
5wg£a;Terasawa et ai. (1 994), supra : Wittekind et ai, (1997), supra X) or 6 (Macias, et ai, (1996), supra; Chen 
et ai (1997), supra ) residues, respectively, are required for the highest affinity interaction. In addition, while 
the PST PIP poly-proline recognition domain appears most like the WW module in that it contains two 
relatively closely spaced tryptophans, the ligands recognized by the WW motif have been found to have the 
20 general structure. XPPXY, with both prolines and the tyrosine performing critical recognition functions (Chen 
et ai, (1995), supra : Einbond et al., FEBS Letts. 384: 1-8 (1996); (Macias, et al., (1996), supra; Pirozzi et al., 
J. Bioi Chem. 272(23)1461 1-14616 (1997). The PTP HSCF poly-proline region contains two adjacent proline 
residues, one of which was found to be involved with binding, but there is not a tyrosine residue C-terminal 
to the second proline. While these results suggest that the recognition of the PTP HSCF poly-proline domain 
25 by PST PIP is quite different from the mechanisms utilized by SH3 and WW modules, a striking similarity 
is found in the involvement of closely spaced proline residues. Similar requirements have been found for the 
prolines in SH3 recognition sites (Feng et al, (1994), supra , while mutation of these residues in WW 
recognition sites, while not quantitatively measured, also showed an effect on binding (Chen et al. (1997), 
supra X Again, structural analysis of SH3 and WW modules bound to their cognate ligands illuminates the role 
30 of these prolines in binding. In the case of both of these motifs, the ligand adopts a type II poly-proline helical 
conformation which allows for interactions between residues within the helical region and conserved side 
chains within the recognition modules (Feng et ai, (1994), supra : Feng et ai t Proc. Natl. Acad. Sci. USA 
92: 12408-12415 (1995)). Because mutation of the prolines in the PST PIP C-terminal region resulted in an 
effect on binding, it is likely that this region may also form a type II proline helix which disposes the relevant 
35 side chains in the appropriate conformation. In addition, and in contrast to the SH3 and WW recognition 
motifs, the glycine contained within this region also appears to be involved with binding to PST PIP. Because 
glycine residues are also mediators of peptide structure, it is possible that this residue may serve to fold this 
small region into an appropriate conformation, and it may be this high concentration of structure inducing 
residues that allows this diminutive peptide to bind to PST PIP so efficiently (Spencer et ai (1997), supra). 
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Significantly, mutagenesis studies have also revealed the importance of non-proline residues in the binding 
of both SH3 and WW motifs to poly-proiine ligands. In the case of the WW domain, the conserved tyrosine 
residue of the ligand is important for the interaction and makes a direct contact with the binding module 
(Macias, et al., (1996), supra ), while amino acids in the N- or C-terminal regions of the SH3 recognition site 
5 can determine the orientation and affinity of binding of the peptide ligands (Feng et al. t (1995), supra )). 
Because mutation of the arginines contained within the PTP HSF C-terminus had an effect on binding, it is 
possible that electrostatic interactions are involved with the binding event, as has been observed for SH3 
recognition modules (Feng et ai, (1995), supra )). The importance of the C-terminal tryptophan suggests the 
possibility of hydrophobic stacking interactions, perhaps with the important tryptophan residue in PST PIP. 
10 Finally, because the residues involved with binding are highly conserved (Cheng et al., (1996), supra )), these 
data are completely consistent with previous studies demonstrating that C-terminal proline-rich peptides 
derived from the C-termini of the related PTPs PEST, PEP (Spencer et al (1997), supra ) and BDP-1 
effectively block the interaction between PTP HSCF and PST PIP. 
M. Summary 

1 5 We have isolated a novel member of the actin binding protein family, PSTPIP, which binds to the 

PEST tyrosine phosphatases via an interaction between the proline rich C-terminal homology domain of the 
PTP and the coiled coil domain of the interacting protein. Like many other proteins associated with the 
cytoskeleton, PSTPIP is tyrosine phosphorylated in V-src transfected cells, and at least a subset of these 
phosphorylated residues appear to be substrates for the catalytic site of the bound PTP HSCF. PSTPIP is 

20 localized to the cortical cytoskeleton, as well as in iamellipodia and on stress fibers, and it appears to migrate 
to the actin-rich cleavage furrow during cytokinesis. Overexpression of the protein in 3T3 cells induces long 
filopodial structures, consistent with a role for PSTPIP in the reorganization of the cytoskeleton. These data 
demonstrate that PSTPIP is a cytoskeletal binding protein whose physiological function is, in part, regulated 
by its degree of tyrosine phosphorylation. 

25 Analysis of the protein database for sequences with homology to PSTPIP suggests potential 

functions for this novel protein. Most of the sequences with significant homology to PSTPIP fall into the actin 
binding family of proteins, and it is clear from the confocal studies reported here that PSTPIP interacts with 
actin. While a number of other actin binding type proteins, including myosin, fodrin and spectrin, show 
homology to PSTPIP, the bulk of these homologies are within the SH3 domain, with little or no match in other 

30 regions of the protein. This is also true for another protein which binds to the actin cytoskeleton in a similar, 
but not identical, manner, p80/85 cortactin (Wu et al., (1991) supra ), although there is weak homology in a 
small region of the coiled coil domain as well as the SH3 region. This is in contrast to the protein with the 
greatest degree of homology, the yeast S.pombe cdcl5p, which shows significant sequence conservation in 
both the SH3 as well as the coiled coil domains (Fankhauser et aL, (1995) supra ). Cdc 15p is a highly 

35 phosphorylated protein which is absolutely required for the formation of the actin ring at the cleavage furrow 
of the post-mitotic cell, and mutations in this protein result in an inability to assemble the actin ring over the 
postmitotic nucleus, thus resulting in multi-nucleate cells. As with PSTPIP, cdc!5p is localized to the cortical 
actin cytoskeleton until anaphase, when it migrates over the postmitotic nucleus and presumably mediates the 
reorganization of the cytoskeleton to the cleavage plane (Fankhauser et al„ (1995) supra . Chang et al.. Cell 
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84:191-194 (1996) and Simanis, Sem. in Cell Biol 6:79-87 (1995)). While the timing of PSTP1P migration 
to the cleavage furrow remains to be determined, its striking co- localization with the actin ring at this site 
during cytokinesis is analogous to what is observed with cdclSp (Fankhauser et al., (1995) supra V In addition, 
the cdcl5p is hyperphosphorylated until the onset of anaphase and the formation of the F actin cytokinetic 

5 cleavage ring, when it becomes significantly dephosphorylated. Interestingly, the yeast protein regains its high 
state of phosphorylation at the conclusion of cell division, suggesting that phosphorylation regulates its 
association with the cleavage furrow. While the type of phosphorylation of cdcl5p has not yet been analyzed, 
this suggests that tyrosine and/or serine threonine phosphatases must be involved with the regulation of the 
function of cdc 1 5p, and provides a mechanism whereby the binding and catalytic activity of a PTP such as 

10 PTP HSCF might function to control cytokinesis. Again, while the timing of tyrosine phosphorylation of 
PSTP1P during the cell cycle has yet to be determined, both the exact conservation of 5 tyrosine residues 
between PSTP1P and cdcl5p as well as the vanadate sensitive tyrosine phosphorylation of endogenous PTP 
interacting protein in Baft cells are suggestive of modulation of phosphotyrosine levels during the cell cycle- 
Thus, the sequence, cellular localization, and phosphorylation of both PSTPIP and cdc 15 suggest that the 

15 mammalian protein is a potential homologue of cdcl5p. 

Phosphorylation, especially of serine and threonine residues, has been previously shown to play 
important roles in regulating events in cytokinesis and reorganization of the cytoskeletal (Yamakita et al., J. 
Cell Biol. 124: 129-137 (1994), Egelhoff et al., CW/ 75:363-371 (1993) and Fishkind et aL (1995) surjra). To 
date, however, the possibility that tyrosine phosphorylation may play a role in these functions has been 

20 incompletely examined. The data reported in this paper demonstrate that the regulation of tyrosine 
phosphorylation on PSTPIP by PTP HSCF may play a role in aspects of cytoskeletal control including, 
possibly, cytokinesis. While the possible kinases involved in such phosphorylation are numerous, the 
information described here as well as elsewhere suggests that a member of the Src family of tyrosine kinases 
may be involved with the phosphorylation of this interacting protein by either direct or indirect mechanisms. 

25 Two other PSTPIP-related proteins, p80/85 cortactin and the HS1 protein, are both known to be tyrosine 
phosphorylated in V-src transformed cells, and cortactin is known to interact with the cytoskeleton in a manner 
similar to PSTPIP (Wu et al., (1991) surjra). In addition, a plethora of other proteins which are involved with 
the cytoskeleton are also tyrosine phosphorylated in V-src transformed cells (Schaller et al., Prog. Nuc. Acid 
Res. and Mol. Biol. 44:205-227 (1993). Interestingly, the -tyrosine phosphorylation of cortactin is also 

30 dramatically enhanced in cells isolated from mice deficient in the Csk kinase (Thomas et al., (1995) supra), 
a tyrosine kinase which phosphorylates the C-terminal inhibitory tyrosine on C-src, suggesting that cortactin 
is either a direct or indirect C-src substrate in vivo. In addition, it has been demonstrated that HS 1 can bind 
to the SH3 and SH2 domains of Src in vitro, and it is also tyrosine phosphorylated by this kinase in vitro and 
in vivo (Takemoto et al., (1996) supra ). Although only distantly related to cortactin and HS1, the tyrosine 

35 phosphorylation of PSTPIP by V-src in transfected cells may therefore be of physiological relevance. 

In addition, previous data have demonstrated that C-src associates with the focal adhesions and 
lamellipodia, as well as other actin-containing sites, consistent with the possibility that it could phosphorylate 
PSTPIP, which also localizes to these regions (Kaplan et al., (1994) supra ). Finally, V-src is known to induce 
cytoskeletal changes in transformed cells, and it has been clearly shown that cortactin, an actin binding protein, 



-54- 



WO 98/35037 



PCT/US98/01774 



becomes reoriented from the ends of the stress fibers to the podosomes of these Src-transformed ceils, 
consistent with the possibility that phosphorylation of such actin binding proteins might mediate changes in 
their cellular localization (Wu et al., ( 1 99 1 ) sjnara). 

The use of dominant negative forms of PTPs has been previously utilized to identify substrates 
5 for several enzymes, most notably PTP PEST (Garton et al. (1996) supra ') and the corkscrew PTP (SH PTP-2) 
(Herbst et al., (1996) supra ). In general, these studies have demonstrated that these dominant negative mutants 
enhance the tyrosine phosphorylation of a surprisingly limited number of substrates in vivo, in contrast to the 
relatively promiscuous behavior of these enzymes in vitro. The demonstration here that co-expression of two 
different dominant negative forms of PTP HSCF mediates a dramatic increase in V-src induced PSTPIP 

10 tyrosine phosphorylation is thus consistent with several conclusions. The first is that these two proteins 
interact intimately in vivo, probably through the C-terminal homology domain and the coiled coil region 
interaction determined from the in vitro binding studies, and the co-precipitation analysis (Fig. 6) supports 
such a physical interaction. This then provides yet another example of the use of a non-catalytic region by 
a PTP to bring the catalytic domain in close proximity to the substrate, although the binding mechanism 

1 5 utilized in this case is novel (Tonks. (1993) sugra). The second is that it is likely that tyrosine phosphorylated 
PSTPIP is an in vivo substrate for the PTP HSCF and it suggests that the enzyme inhibited by vanadate in the 
endogenous phosphotyrosine experiment in Baf3 cells, where both PSTPIP and PTP HSCF are expressed, is 
likely to be PTP HSCF. Finally, if we assume that the mutant forms of PTP HSCF are endowed with the same 
degree of substrate specificity that has been found with other dominant negative PTPs, than the V-src co- 

20 transfection studies further suggest that either Src or a related family member may be a kinase which is 
involved with the tyrosine phosphorylation of PSTPIP in vivo in non-transfected cells. 

The nature of the high affinity binding between the proline rich C-terminai homology domain and 
the coiled coil region is reminiscent of that previously described for the SH3-proline rich core interaction 
(Pawson, (1995) supra ). In this latter case, proline helices induce the formation of highly structured small 

25 peptide domains that bind with relatively high affinity and specificity to the binding pocket of the SH3 
domain, and various interactions, including salt bridges, mediate the specificity and direction of peptide 
binding (Feng et al., (1995) supra ). Analysis of the proline rich C-terminal homology domains of three PEST 
PTPs, all of which appear to inhibit the PSTPIP-PTP HSCF binding interaction with similar IC50s, reveals 
that they share a proline rich core region that would be predicted to form a proline helix similar to that seen 

30 for SH3 binding sites (Yang et al., (1993) supra . Matthews et ah, (1992) supra and Cheng et al., (1996) supra ). 
This region contains a number of charged residues, and it is possible that the potential helical nature of this 
domain positions these residues in an appropriate binding conformation for interaction with a site within the 
coiled coil domain. Because ail of the PEST PTPs are predicted to bind to PSTPIP via this proline rich region, 
it is possible that the interacting protein's phosphotyrosine content is modulated by different PEST PTPs in 

35 different cell types. Along these lines, it is interesting to note that the only hyperphosphorylated protein 
observed in COS cells transfected with dominant negative (D-A) PTP PEST was p 1 30 cas (Garton et al., ( 1 996) 
supra ). This results suggest that, if PSTPIP is expressed in COS cells, it is either not tyrosine phosphorylated 
or is not a substrate for this PTP in this cell line. The mechanism by which PSTPIP migrates from the cortical 
actin, lamellipodia and stress fiber regions in resting cells to the cytokinetic cleavage furrow in dividing cells 
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can only be speculated upon (Strome, Cell 72:3-6 (1993). One possibility is that this protein binds tightly to 
actin, and when the actin is reoriented to the cleavage plane, the PSTPIP accompanies it passively (Cao et al., 
( 1 990a) supra . (Cao et al., ( 1 990b) supra and Fishkind. ( 1 993) supra). However, experiments in yeast where 
cdclSp is deleted revealed that cortical actin did not migrate to the cleavage plane in the absence of this 
5 protein, suggesting that cdc 1 5p actively traverses to this site and mediates the assembly of the actin ring 
(Simanis (1995) supra). These data thus suggest that if PSTPIP is a mammalian homologue of cdcl 5p, that 
dominant negative mutants in this protein should abolish the assembly of actin at the cleavage furrow. 
Interestingly, it appears that deletion mutants of cdcl5p which lack the SH3 domain are incapable of rescuing 
the cdcl 5 mutants, suggesting a critical role for this C-terminal domain in assembling the cytokinetic actin ring 

1 0 (Fankhauser et ah, ( 1 995) supra ). 

A possible mechanism by which PSTPIP functions is suggested by the results of overexpression 
studies in murine 3T3 cells. The extended filopodial structures in many of these transfected cells are consistent 
with the possibility that the unregulated expression of the protein mediates an ectopic and organized assembly 
of actin filaments, thus resulting in a cellular protrusion containing PSTPIP and F actin. In this regard, the 

15 striking level of lysines in the predicted coiled coil domain of this protein is consistent with previously 
described actin binding sites (Vandekerckhove. Curr. Opin. Cell Biol. 2:41-50 (1990) and Friederich et al., 
Cell 70:81-92 (1992)). Interestingly, many of the transfected cells contained a single filopodial-like structure, 
suggesting that this morphological feature is rapidly formed and is likely to have a negative influence on cell 
viability. The apparent small size of many of these cells suggests that this actin-containing spike is formed 

20 in the absence of plasma membrane synthesis, also consistent with a rapid formation of the structure. The 
apparent heterogeneity in penetrance of this morphological entity may be due either to expression levels or 
differences in post-trans lational modifications of the transfected proteins. Thus, it would appear that PSTPIP 
may play a role in the rapid assemble of a highly organized F actin containing structure. 
EXAMPLE 9 

25 Expression of PSTPIP in E. coli 

This example illustrates preparation of an unglycosylated form of PSTPIP by recombinant 
expression in E. coli. 

The DNA sequence encoding PSTPIP (SEQ ID NO:2) is initially amplified using selected PCR 
primers. The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites 

30 on the selected expression vector. A variety of expression vectors may be employed. An example of a 
suitable vector is pBR322 (derived from E. coli: see Bolivar et ah, Gene . 2:95 (1977)) which contains genes 
for ampicillin and tetracycline resistance. The vector is digested with restriction enzyme and 
dephosphorylated. The PCR amplified sequences are then ligated into the vector. The vector will preferably 
include sequences which encode for an antibiotic resistance gene, atrp promoter, a polyhis leader (including 

35 the first six STII codons, polyhis sequence, and enterokinase cleavage site), the PSTPIP coding region, lambda 
transcriptional terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described 
in Sambrook et al., supra . Transformants are identified by their ability to grow on LB plates and antibiotic 
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resistant colonies are then selected. Ptasmid DNA can be isolated and confirmed by restriction analysis and 
DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented 
with antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells 
5 are then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The 
cell pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the 
solubilized PSTPIP protein can then be purified using a metal chelating column under conditions that allow 
tight binding of the protein. 
10 EXAMPLE 10 

Expression of PSTPIP in mammalian cells 

This example illustrates preparation of a glycosylated form of PSTPIP by recombinant expression 
in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression 
1 5 vector. Optionally, the PSTPIP DNA is ligated into pRK5 with selected restriction enzymes to allow insertion 
of the PSTPIP DNA using ligation methods such as described in Sarnbrook et al., supra . The resulting vector 
is called pRK5-PSTPIP. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) 
are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum 

20 and optionally, nutrient components and/or antibiotics. About 10 ug pRK5-PSTPIP DNA is mixed with about 
1 ug DNA encoding the VA RNA gene [Thimmappaya et al., Cejl, U_:543 (1982)] and dissolved in 500 ul 
of 1 mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 ul of 50 mM 
HEPES (pH 7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 
25°C The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 

25 37°C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 
cells are then washed with serum free medium, fresh medium is added and the cells are incubated for about 
5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 uCi/ml 35 S-cysteine and 200 uCi/mi 35 S- 

30 methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, 
and loaded onto a 15% SDS gel. The processed gel may be dried and exposed to film for a selected period 
of time to reveal the presence of PSTPIP polypeptide. The cultures containing transfected cells may undergo 
further incubation (in serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PSTPIP may be introduced into 293 cells transiently using the dextran 

35 sulfate method described by Somparyrac et al., Proc. Natl. Acad. ScL . 12:7575 (198 1). 293 cells are grown 
to maximal density in a spinner flask and 700 ug pRK5-PSTPIP DNA is added. The cells are first 
concentrated from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate 
is incubated on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed 
with tissue culture medium, and re-introduced into the spinner flask containing tissue culture medium, 5 ug/ml 
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bovine insuiin and 0.1 ug/ml bovine transferrin. After about four days, the conditioned media is centrifuged 
and filtered to remove cells and debris. The sample containing expressed PSTPIP can then be concentrated 
and purified by any selected method, such as dialysis and/or column chromatography. 

In another embodiment, PSTPIP can be expressed in CHO cells. The pRK5-PSTPIP can be 
5 transfected into CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the 
cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing 
a radiolabel such as 3 5 S- methionine. After determining the presence of PSTPIP polypeptide, the culture 
medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 days, 
and then the conditioned medium is harvested. The medium containing the expressed PSTPIP can then be 
1 0 concentrated and purified by any selected method. 

Epitope-tagged PSTPIP may also be expressed in host CHO cells. The PSTPIP may be subcloned 
out of the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag 
such as a poly-his tag into a Baculovirus expression vector. The poly-his tagged PSTPIP insert can then be 
subcloned into a SV40 driven vector containing a selection marker such as DHFR for selection of stable 
1 5 clones. Finally, the CHO cells can be transfected (as described above) with the SV40 driven vector. Labeling 
may be performed, as described above, to verify expression. The culture medium containing the expressed 
poly-His tagged PSTPIP can then be concentrated and purified by any selected method, such as by Ni 2+ - 
chelate affinity chromatography. 
EXAMPLE 11 
20 Expression of PSTPIP in Yeast 

The following method describes recombinant expression of PSTPIP in yeast. 
First, yeast expression vectors are constructed for intracellular production or secretion of PSTPIP 
from the ADH2/GAPDH promoter. DNA encoding PSTPIP, a selected signal peptide and the promoter is 
inserted into suitable restriction enzyme sites in the selected plasmid to direct intracellular expression of 
25 PSTPIP. For secretion, DNA encoding PSTPIP can be cloned into the selected plasmid, together with DMA 
encoding the ADH2/GAPDH promoter, the yeast alpha-factor secretory signal/leader sequence, and linker 
sequences (if needed) for expression of PSTPIP. 

Yeast ceils, such as yeast strain AB1 10, can then be transformed with the expression plasmids 
described above and cultured in selected fermentation media. The transformed yeast supernatants can be 
30 analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining 
of the gels with Coomassie Blue stain. 

Recombinant PSTPIP can subsequently be isolated and purified by removing the yeast cells from 
the fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. 
The concentrate containing PSTPIP may further be purified using selected column chromatography resins. 
35 EXAMPLE 12 

Expression of PSTPIP in Baculovirus 

The following method describes recombinant expression of PSTPIP in Baculovirus. 
The PSTPIP is fused upstream of an epitope tag contained with a baculovirus expression vector. 
Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). A variety of 
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plasmids may be employed, including plasmids derived from commercially available plasmids such as 
pVL1393 (Novagen). Briefly, the PSTPIP or the desired portion of the PSTPIP (such as the sequence 
encoding the extracellular domain of a transmembrane protein) is amplified by PCR with primers 
complementary to the 5* and 3' regions. The 5 1 primer may incorporate flanking (selected) restriction enzyme 
5 sites. The product is then digested with those selected restriction enzymes and subcloned into the expression 
vector. 

Recombinant baculovirus is generated by co-trans fecting the above plasmid and BaculoGold™ 
virus DNA (Pharmingen) into Spodoptera frugiperda ("Sf9 n ) cells (ATCC CRL 1711) using lipofectin 
(commercially available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C. the released viruses are 
1 0 harvested and used for further amplifications. Viral infection and protein expression is performed as described 
by O'Reilley et al., Baculovirus expression vectors: A laboratory Manual, Oxford: Oxford University Press 
(1994). 

Expressed poly-his tagged PSTPIP can then be purified, for example, by Ni 2+ -chelate affinity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 

15 Rupert et al.. Nature, 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% Glycerol; O.I%NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicaies are cleared by centrifugal ion, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl, 10% Glycerol, pH 7.8) and filtered through a 0.45 urn 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 

20 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract 
is loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 2 go with loading buffer, 
at which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
phosphate; 300 mM NaCl, 10% Glycerol, pH 6.0), which elutes nonspeciflcally bound protein. After reaching 
A 280 Dase » ne again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 

25 buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted HisjQ-tagged PSTPIP 
are pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PSTPIP can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 

30 N. Concluding Remarks : 

The foregoing description details specific methods which can be employed to practice the present 
invention. Having detailed such specific methods, those skilled in the art will well enough known how to 
devise alternative reliable methods at arriving at the same information in using the fruits of the present 
invention. Thus, however, detailed the foregoing may appear in text, it should not be construed as limiting 

35 the overall scope thereof; rather, the ambit of the present invention is to be determined only by the lawful 
construction of the appended claims. All documents cited herein are expressly incorporated by reference. 
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